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COMPOSITIONS AND METHODS 
RELATING TO LUNG SPECIFIC GENES 



INTRODUCTIOl 

This application claims the benefit of priority from 

5 U.S. Provisional Application Serial No. 60/219,834, filed 

July 21, 2000,^ which is herein incorporated in its 
A 

entirety . 

FIELD OF THE INVENTION 

The present invention relates to newly identified 

10 nucleic acids and polypeptides present in normal and 

neoplastic lung cells, including fragments, variants and 
derivatives of the nucleic acids and polypeptides. The 
present invention also relates to antibodies to the 
polypeptides of the invention, as well as agonists and 

15 antagonists of the polypeptides of the invention. The 
invention also relates to compositions comprising the 
nucleic acids, polypeptides, antibodies, variants, 
derivatives, agonists and antagonists of the invention and 
methods for the use of these compositions. These uses 

20 include identifying, diagnosing, monitoring, staging, 

imaging and treating lung cancer and non-cancerous disease 
states in lung, identifying lung tissue, monitoring and 
modifying lung embryonic development and differentiation, 
and identifying and/or designing agonists and antagonists 

25 of polypeptides of the invention. The uses also include 
gene therapy, production of transgenic animals and cells, 
and production of engineered lung tissue for treatment and 
research . 
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BACKGROUND OF THE INVENTION 

Throughout the last hundred years, the incidence of 
lung cancer has steadily increased, so much so that now in 
many countries, it is the most common cancers. In fact, 
5 lung cancer is the second most prevalent type of cancer for 
both men and women in the United States and is the most 
common cause of cancer death in both sexes . Lung cancer 
deaths have increased ten- fold in both men and women since 
1930, primarily due to an increase in cigarette smoking, 

10 but also due to an increased exposure to arsenic, asbestos, 
chromates, chloromethyl ethers, nickel, polycyclic aromatic 
hydrocarbons and other agents. See Scott, Lung Cancer: A 
Guide to Diagnosis and Treatment, Addicus Books (2000) and 
Alberg et al . , in Kane et al . (eds.) Biology of Lung 

15 Cancer, pp. 11-52, Marcel Dekker, Inc. (1998) . Lung cancer 
may result from a primary tumor originating in the lung or 
a secondary tumor which has spread from another organ such 
as the bowel or breast . Although there are over a dozen 
types of lung cancer, over 90% fall into two categories: 

20 small cell lung cancer (SCLC) and non-small cell lung 
cancer (NSCLC) . See Scott, supra. About 20-25% of all 
lung cancers are characterized as SCLC, while 70-80% are 
diagnosed as NSCLC. Id. A rare type of lung cancer is 
mesothelioma, which is generally caused by exposure to 

25 asbestos, and which affects the pleura of the lung. Lung 
cancer is usually diagnosed or screened for by chest x-ray, 
CAT scans, PET scans, or by sputum cytology. A diagnosis 
of lung cancer is usually confirmed by biopsy of the 
tissue. Id. 

30 SCLC tumors are highly metastatic and grow 

quickly. By the time a patient has been diagnosed with 
SCLC, the cancer has usually already spread to other parts 
of the body, including lymph nodes, adrenals, liver, bone, 
brain and bone marrow. See Scott, supra; Van Houtte et al . 

35 (eds.), Progress and Perspective in the Treatment of Lung 
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Cancer, Springer-Verlag (1999) . Because the disease has 
usually spread to such an extent that surgery is not an 
option, the current treatment of choice is chemotherapy 
plus chest irradiation. See Van Houtte, supra. The stage 
5 of disease is a principal predictor of long-term survival. 
Less than 5% of patients with extensive disease that has 
spread beyond one lung and surrounding lymph nodes, live 
longer than two years. Id. However, the probability of 
five-year survival is three to four times higher if the 
10 disease is diagnosed and treated when it is still in a 
limited stage, i.e., not having spread beyond one lung, 
hi Id. 

y3 NSCLC is generally divided into three types: 

J5 squamous cell carcinoma, adenocarcinoma and large cell 

Lfl 15 carcinoma. Both squamous cell cancer and adenocarcinoma 

si develop from the cells that line the airways; however, 

^ adenocarcinoma develops from the goblet cells that produce 

mucus. Large cell lung cancer has been thus named because 
yJ the cells look large and rounded when viewed 

2 20 microscopically, and generally are considered relatively 

^ undifferentiated. See Yesner, Atlas of Lung Cancer, 

Lippincott-Raven (1998) . 

Secondary lung cancer is a cancer initiated elsewhere 
in the body that has spread to the lungs. Cancers that 
25 metastasize to the lung include, but are not limited to, 
breast cancer, melanoma, colon cancer and Hodgkin' s 
lymphoma. Treatment for secondary lung cancer may depend 
upon the source of the original cancer. In other words, a 
lung cancer that originated from breast cancer may be more 
3 0 responsive to breast cancer treatments and a lung cancer 
that originated from the colon cancer may be more 
responsive to colon cancer treatments. 

The stage of a cancer indicates how far it has spread 
and is an important indicator of the prognosis. In 
35 addition, staging is important because treatment is often 
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decided according to the stage of a cancer. SCLC is 
divided into two stages: limited disease, i.e., cancer 
that can only be seen in one lung and in nearby lymph 
nodes; and extensive disease, i.e., cancer that has spread 
outside the lung to the chest or to other parts of the 
body. For most patients with SCLC, the disease has already 
progressed to lymph nodes or elsewhere in the body at the 
time of diagnosis. See Scott, supra. Even if spreading is 
not apparent on the scans, it is likely that some cancer 
cells may have spread away and traveled through the 
bloodstream or lymph system. In general, chemotherapy with 
or without radiotherapy is often the preferred treatment. 
The initial scans and tests done at first will be used 
later to see how well a patient is responding to treatment. 

In contrast, non- small cell cancer may be divided 
into four stages. Stage I is highly localized cancer with 
no cancer in the lymph nodes. Stage II cancer has spread 
to the lymph nodes at the top of the affected lung. Stage 
III cancer has spread near to where the cancer started. 
This can be to the chest wall, the covering of the lung 
(pleura) , the middle of the chest (mediastinum) or other 
lymph nodes . Stage IV cancer has spread to another part of 
the body. Stage I-III cancer is usually treated with 
surgery, with or without chemotherapy. Stage IV cancer is 
usually treated with chemotherapy and/or palliative care. 

A number of chromosomal and genetic abnormalities 
have been observed in lung cancer. In NSCLC, chromosomal 
aberrations have been described on 3p, 9p, lip, 15p and 
17p, and chromosomal deletions have been seen on 
chromosomes 7, 11, 13 and 19. See Skarin (ed.), 
Multimodality Treatment of Lung Cancer, Marcel Dekker, Inc. 
(2000); Gemmill et al . , pp. 465-502, in Kane, supra; 
Bailey-Wilson et al . , pp. 53-98, in Kane, supra. 
Chromosomal abnormalities have been described on lp, 3p, 
5q, 6q, 8q, 13q and 17p in SCLC. Id. In addition, the 
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loss of the short arm of chromosome 3p has also been seen 
in greater than 90% of SCLC tumors and approximately 50% of 

NSCLC tumors. Id. 

A number of oncogenes and tumor suppressor genes have 

5 been implicated in lung cancer. See Mabry, pp. 391-412, in 
Kane, supra and Sclafani et al . , pp. 295-316, in Kane, 
supra. In both SCLC and NSCLC, the p53 tumor suppressor 
gene is mutated in over 50% of lung cancers. See Yesner, 
supra. Another tumor suppressor gene, FHIT, which is found 

10 on chromosome 3p, is mutated by tobacco smoke. Id.; 

Skarin, supra. In addition, more than 95% of SCLCs and 
approximately 20-60% of NSCLCs have an absent or abnormal 
retinoblastoma (Rb) protein, another tumor suppressor gene. 
The ras oncogene (particularly K-ras) is mutated in 20-30% 

15 of NSCLC specimens and the c-erbB2 oncogene is expressed in 
18% of stage 2 NSCLC and 60% of stage 4 NSCLC specimens. 
See Van Houtte, supra. Other tumor suppressor genes that 
are found in a region of chromosome 9, specifically in the 
region of 9p21, are deleted in many cancer cells, including 

20 pl6 rwK4A and p!5 INK4B . See Bailey-Wilson, supra; Sclafani et 
al., supra. These tumor suppressor genes may also be 
implicated in lung cancer pathogenesis. 

In addition, many lung cancer cells produce growth 
factors that may act in an autocrine fashion on lung cancer 

25 cells. See Siegfried et al . , pp. 317-336, in Kane, supra; 
Moody, pp. 337-370, in Kane, supra and Heasley et al . , 371- 
3 90, in Kane, supra. In SCLC, many tumor cells produce 
gastrin-releasing peptide (GRP) , which is a proliferative 
growth factor for these cells. See Skarin, supra. Many 

3 0 NSCLC tumors express epidermal growth factor (EGF) 

receptors, allowing NSCLC cells to proliferate in response 
to EGF. Insulin-like growth factor (IGF-I) is elevated in 
greater than 95% of SCLC and greater than 80% of NSCLC 
tumors; it is thought to function as an autocrine growth 



DEX-0214 - 6 - PATENT 

factor. Id. Finally, stem cell factor (SCF, also known as 
steel factor or kit ligand) and c-Kit (a proto-oncoprotein 
tyrosine kinase receptor for SCF) are both expressed at 
high levels in SCLC, and thus may form an autocrine loop 
5 that increases proliferation. Id. 

Although the majority of lung cancer cases are 
attributable to cigarette smoking, most smokers do not 
develop lung cancer. Epidemiological evidence has 
suggested that susceptibility to lung cancer may be 
10 inherited in a Mendel ian fashion, and thus have an 

inherited genetic component. Bailey-Wilson, supra. Thus, 
it is thought that certain allelic variants at some genetic 
loci may affect susceptibility to lung cancer. Id. One way 
to identify which allelic variants are likely to be 
15 involved in lung cancer susceptibility, as well as 

susceptibility to other diseases, is to look at allelic 
variants of genes that are highly expressed in lung. 

The lung is also susceptible to a number of other 
debilitating diseases, including, without limitation, 
20 emphysema, pneumonia, cystic fibrosis and asthma. See 
Stockley (ed.), Molecular Biology of the Lung, Volume I: 
Emphysema and Infection, Birkhauser Verlag (1999) , 
hereafter Stockley I, and Stockley (ed.), Molecular Biology 
of the Lung, Volume II: Asthma and Cancer, Birkhauser 
25 Verlag (1999), hereafter Stockley II. The cause of many 
these disorders is still not well understood and there are 
few, if any, good treatment options for many of these 
noncancerous lung disorders. Thus, there remains a need to 
understand various noncancerous lung disorders and to 
30 identify treatments for these diseases. 

In yet another aspect, the development and 
differentiation of the lung tissue is important during 
embryonic development. All of the epithelial cells of the 
respiratory tract, including those of the lung and bronchi, 
3 5 are derived from the primitive endodermal cells that line 
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the embryonic outpouching. -See Yesner, supra. During 
embryonic development, multipotent endodermal stem cells 
differentiate into many different types of specialized 
cells, which include ciliated cells for moving inhaled 
5 particles, goblet cells for producing mucus, Kulchitsky' s 
cells for endocrine function, and Clara cells and type II 
pneumocytes for secreting surfactant protein. Id. 
Improper development and differentiation may cause 
respiratory disorders and distress in infants, particularly 
10 in premature infants, whose lungs cannot produce sufficient 
surfactant when they are born. Further, some lung cancer 

^ cells, particularly small cell carcinomas, appear 

multipotent, and can spontaneously differentiate into a 

□ number of cell types, including small cell carcinoma, 

15 adenocarcinoma and squamous cell carcinoma. Id. Thus, a 

m better understanding of lung development and 

H differentiation may help facilitate understanding of lung 

Q cancer initiation and progression. 

^ Accordingly, there is a great need for more sensitive 

Q 20 and accurate methods for predicting whether a person is 

y likely to develop lung cancer, for diagnosing lung cancer, 

for monitoring the progression of the disease, for staging 
the lung cancer, for determining whether the lung cancer 
has metastasized and for imaging the lung cancer. There is 
25 also a need for better treatment of lung cancer. Further, 
there is also a great need for diagnosing and treating 
noncancerous lung disorders such as emphysema, pneumonia, 
lung infection, pulmonary fibrosis, cystic fibrosis and 
asthma. There is also a need for compositions and methods 
3 0 of using them that can be used to identify lung tissue for 
forensic purposes and for determining whether a particular 
cell or tissue exhibits lung- specif ic characteristics. 

In the present invention, methods are provided for 
detecting, diagnosing, monitoring, staging, 
35 prognosticating, imaging and treating lung cancer via lung 
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specific genes referred to herein as LSGs . For purposes of 
the present invention, LSG refers, among other things, to 
native protein expressed by the gene comprising a 
polynucleotide sequence of SEQ IDN0:1, 2, 3, 4, 5, 6, 7, 
5 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 22 
or a contig of SEQ ID NO: 19 or 21 as depicted in SEQ ID 
NO: 37, or 38, respectively. By "LSG" it is also meant 
herein polynucleotides which, due to degeneracy in genetic 
coding, comprise variations in nucleotide sequence as 
10 compared to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37, 38, 39 or 
40 but which still encode the same polypeptide. Exemplary 
amino acid sequences for LSG polypeptides are set forth in 
SEQ ID NO: 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 
15 51, 52, 53, 54, 55 and 56. In the alternative, what is 

meant by LSG as used herein, means the native mRNA encoded 
by the gene comprising the polynucleotide sequence of SEQ 
ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20, 21 or 22, levels of the gene comprising 
2 0 the polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 
7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 
22 or levels of a polynucleotide which is capable of 
hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
25 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37, or 38. 

Other objects, features, advantages and aspects of 
the present invention will become apparent to those of 
skill in the art from the following description. It should 
be understood, however, that the following description and 
30 the specific examples, while indicating preferred 
embodiments of the invention are given by way of 
illustration only. Various changes and modifications 
within the spirit and scope of the disclosed invention will 
become readily apparent to those skilled in the art from 
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reading the following description and from reading the 
other parts of the present disclosure. 

SUMMARY OF THE INVENTION 

Toward these ends, and others, it is an object of the 
5 present invention to provide LSGs comprising a 

polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 

11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38 a 
protein expressed by a polynucleotide of SEQ ID NO: 1, 2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 

10 19, 20, 21, 22, 37 or 38 or a variant thereof which 

expresses the protein; or a polynucleotide which is capable 
of hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 

12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
15 Exemplary LSG polypeptides of the present invention are 

depicted in SEQ DI NO: 39, 40, 41, 42, 43, 44, 45, 46, 47, 
48, 49, 50, 51, 52, 53, 54, 55 or 56. 

It is another object of the present invention to 
provide a method for diagnosing the presence of lung cancer 
20 by analyzing for changes in levels of LSG in cells, tissues 
or bodily fluids compared with levels of LSG in preferably 
the same cells, tissues, or bodily fluid type of a normal 
human control, wherein a change in levels of LSG in the 
patient versus the normal human control is associated with 

2 5 lung cancer. 

Further provided is a method of diagnosing metastatic 
lung cancer in a patient having lung cancer which is not 
known to have metastasized by identifying a human patient 
suspected of having lung cancer that has metastasized; 

30 analyzing a sample of cells, tissues, or bodily fluid from 
such patient for LSG; comparing the LSG levels in such 
cells, tissues, or bodily fluid with levels of LSG in 
preferably the same cells, tissues, or bodily fluid type of 
a normal human control, wherein an increase in LSG levels 
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in the patient versus the normal human control is 
associated with lung cancer which has metastasized. 

Also provided by the invention is a method of staging 
lung cancer in a human which has such cancer by identifying 
5 a human patient having such cancer; analyzing a sample of 
cells, tissues, or bodily fluid from such patient for LSG; 
comparing LSG levels in such cells, tissues, or bodily 
fluid with levels of LSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
10 sample, wherein an increase in LSG levels in the patient 

versus the normal human control is associated with a cancer 
which is progressing and a decrease in the levels of LSG is 
5 associated with a cancer which is regressing or in 

remission . 

15 Further provided is a method of monitoring lung 

cancer in a human having such cancer for the onset of 
metastasis. The method comprises identifying a human 
patient having such cancer that is not known to have 
metastasized; periodically analyzing a sample of cells, 
20 tissues, or bodily fluid from such patient for LSG; 

comparing the LSG levels in such cells, tissue, or bodily 
fluid with levels of LSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in LSG levels in the patient 

2 5 versus the normal human control is associated with a cancer 
which has metastasized. 

Further provided is a method of monitoring the change 
in stage of lung cancer in a human having such cancer by 
looking at levels of LSG in a human having such cancer. 

3 0 The method comprises identifying a human patient having 
such cancer; periodically analyzing a sample of cells, 
tissues, or bodily fluid from such patient for LSG; 
comparing the LSG levels in such cells, tissue, or bodily 
fluid with levels of LSG in preferably the same cells, 

35 tissues, or bodily fluid type of a normal human control 
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sample, wherein an increase in LSG levels in the patient 
versus the normal human control is associated with a cancer 
which is progressing and a decrease in the levels of LSG is 
associated with a cancer which is regressing or in 

5 remission. 

Further provided are methods of designing new 
therapeutic agents targeted to a LSG for use in imaging and 
treating lung cancer. For example, in one embodiment, 
therapeutic agents such as antibodies targeted against LSG 

10 or fragments of such antibodies can be used to treat, 

detect or image localization of LSG in a patient for the 
purpose of detecting or diagnosing a disease or condition. 
In this embodiment, an increase in the amount of labeled 
antibody detected as compared to normal tissue would be 

15 indicative of tumor metastases or growth. Such antibodies 
can be polyclonal, monoclonal, or omniclonal or prepared by 
molecular biology techniques. The term "antibody", as used 
herein and throughout the instant specification is also 
meant to include aptamers and single- stranded 

2 0 oligonucleotides such as those derived from an in vitro 
evolution protocol referred to as SELEX and well known to 
those skilled in the art. Antibodies can be labeled with a 
variety of detectable and therapeutic labels including, but 
not limited to, radioisotopes and paramagnetic metals. 

2 5 Therapeutic agents such as small molecules and antibodies 

which decrease the concentration and/or activity of LSG can 
also be used in the treatment of diseases characterized by 
overexpression of LSG. Such agents can be readily 
identified in accordance with teachings herein. 

3 0 Other objects, features, advantages and aspects of 

the present invention will become apparent to those of 
skill in the art from the following description. It should 
be understood, however, that the following description and 
the specific examples, while indicating preferred 
3 5 embodiments of the invention, are given by way of 
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illustration only. Various changes and modifications 
within the spirit and scope of the disclosed invention will 
become readily apparent to those skilled in the art from 
reading the following description and from reading the 
5 other parts of the present disclosure. 

GLOSSARY 

The following illustrative explanations are provided 
to facilitate understanding of certain terms used 
frequently herein, particularly in the examples. The 
10 explanations are provided as a convenience and are not 
limitative of the invention. 
<£) ISOLATED means altered "by the hand of man" from its 

n natural state; i.e., that, if it occurs in nature, it has 

*° been changed or removed from its original environment, or 

\t\ 

m 15 both. 

N For example, a naturally occurring polynucleotide or 

Q a polypeptide naturally present in a living animal in its 

y natural state is not "isolated," but the same 

□ polynucleotide or polypeptide separated from the coexisting 

P 20 materials of its natural state is "isolated", as the term 

is employed herein. For example, with respect to 
polynucleotides, the term isolated means that it is 
separated from the chromosome and cell in which it 
naturally occurs. 
25 As part of or following isolation, such 

polynucleotides can be joined to other polynucleotides, 
such as DNAs , for mutagenesis, to form fusion proteins, and 
for propagation or expression in a host, for instance. The 
isolated polynucleotides, alone or joined to other 
30 polynucleotides such as vectors, can be introduced into 
host cells, in culture or in whole organisms. When 
introduced into host cells in culture or in whole 
organisms, such DNAs still would be isolated, as the term 
is used herein, because they would not be in their 
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naturally occurring form or environment. Similarly, the 
polynucleotides and polypeptides may occur in a 
composition, such as media formulations, solutions for 
introduction of polynucleotides or polypeptides, for 
5 example, into cells, compositions or solutions for chemical 
or enzymatic reactions, for instance, which are not 
naturally occurring compositions, and, therein remain 
isolated polynucleotides or polypeptides within the meaning 
of that term as it is employed herein. 

10 OLIGONUCLEOTIDE (S) refers to relatively short 

polynucleotides. Often the term refers to single- stranded 
deoxyribonucleotides, but it can refer as well to single-or 
double- stranded ribonucleotides, RNA : DNA hybrids and 
double -stranded DNAs , among others. 

15 Oligonucleotides, such as single- stranded DNA probe 

oligonucleotides, often are synthesized by chemical 
methods, such as those implemented on automated 
oligonucleotide synthesizers. However, oligonucleotides 
can be made by a variety of other methods, including in 

2 0 vitro recombinant DNA-mediated techniques and by expression 
of DNAs in cells and organisms. 

Initially, chemically synthesized DNAs typically are 
obtained without a 5' phosphate. The 5' ends of such 
oligonucleotides are not substrates for phosphodiester bond 

25 formation by ligation reactions that employ DNA ligases 
typically used to form recombinant DNA molecules. Where 
ligation of such oligonucleotides is desired, a phosphate 
can be added by standard techniques, such as those that 
employ a kinase and ATP. 

30 The 3' end of a chemically synthesized 

oligonucleotide generally has a free hydroxyl group and, in 
the presence of a ligase such as T4 DNA ligase, readily 
will form a phosphodiester bond with a 5' phosphate of 
another polynucleotide, such as another oligonucleotide. 

35 As is well known, this reaction can be prevented 
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selectively, where desired, by removing the 5' phosphates 
of the other polynucleotide (s) prior to ligation. 

POLYNUCLEOTIDE (S) generally refers to any 
polyribonucleotide or polydeoxribonucleotide and is 
5 inclusive of unmodified RNA or DNA as well as modified RNA 
or DNA. Thus, for instance, polynucleotides as used herein 
refers to, among other things, single- and double- stranded 
DNA, DNA that is a mixture of single- and double- stranded 
regions, single- and double- stranded RNA, and RNA that is 

10 mixture of single- and double -stranded regions, hybrid 
molecules comprising DNA and RNA that may be single- 
stranded or, more typically, double- stranded or a mixture 
of single- and double -stranded regions. In addition, 
polynucleotide, as used herein, refers to triple- stranded 

15 regions comprising RNA or DNA or both RNA and DNA. The 
strands in such regions may be from the same molecule or 
from different molecules. The regions may include all of 
one or more of the molecules, but more typically involve 
only a region of some of the molecules. One of the 

20 molecules of a triple-helical region often is an 
oligonucleotide . 

As used herein, the term polynucleotide is also 
inclusive of DNAs or RNAs as described above that contain 
one or more modified bases. Thus, DNAs or RNAs with 

25 backbones modified for stability or for other reasons are 
"polynucleotides" as that term is intended herein. 
Moreover, DNAs or RNAs comprising unusual bases, such as 
inosine, or modified bases, such as tritylated bases, to 
name just two examples, are polynucleotides as the term is 

30 used herein. 

It will be appreciated that a great variety of 
modifications have been made to DNA and RNA that serve many 
useful purposes known to those of skill in the art. The 
term polynucleotide as it is employed herein embraces such 

35 chemically, enzymat ically or metabolically modified forms 
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of polynucleotides, as well as chemical forms of DNA and 
RNA characteristic of viruses and cells, including simple 
and complex cells, inter alia. 

POLYPEPTIDES, as used herein, includes all 
5 polypeptides as described below. The basic structure of 
polypeptides is well known and has been described in 
innumerable textbooks and other publications in the art. 
In this context, the term is used herein to refer to any 
peptide or protein comprising two or more amino acids 

10 joined to each other in a linear chain by peptide bonds. 

As used herein, the term refers to both short chains, which 
also commonly are referred to in the art as peptides, 
oligopeptides and oligomers, for example, and to longer 
chains, which generally are referred to in the art as 

15 proteins, of which there are many types. It will be 

appreciated that polypeptides often contain amino acids 
other than the 2 0 amino acids commonly referred to as the 
20 naturally occurring amino acids, and that many amino 
acids, including the terminal amino acids, may be modified 

20 in a given polypeptide, either by natural processes such as 
processing and other post-translational modifications, or 
by chemical modification techniques which are well known to 
the art. Even the common modifications that occur 
naturally in polypeptides are too numerous to list 

25 exhaustively here, but they are well described in basic 
texts and in more detailed monographs, as well as in a 
voluminous research literature, and they are well known to 
those of skill in the art. 

Modifications which may be present in polypeptides of 

30 the present invention include, to name an illustrative few, 
acetylation, acylation, ADP-ribosylation, amidation, 
covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or 
35 lipid derivative, covalent attachment of 
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phosphotidylinositol, cross-linking, cyclization, disulfide 
bond formation, demethylation, formation of covalent cross- 
links, formation of cystine, formation of pyroglutamate , 
formylation, gamma-carboxylation, glycosylation, GPI anchor 
5 formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylat ion, 
sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination . 

10 Such modifications are well known to those of skill 

and have been described in great detail in the scientific 
literature. Several particularly common modifications 
including, but not limited to, glycosylation, lipid 
attachment, sulfation, gamma-carboxylation of glutamic acid 

15 residues, hydroxylation and ADP-ribosylation are described 
in most basic texts, such as, for instance PROTEINS 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. 
Creighton, W. H. Freeman and Company, New York (1993) . Many 
detailed reviews are available on this subject, such as, 

20 for example, those provided by Wold, F., Posttranslational 
Protein Modifications: Perspectives and Prospects, pgs . 1- 
12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, 
B. C. Johnson, Ed., Academic Press, New York (1983); 
Seifter et al . , Analysis for protein modifications and 

25 nonprotein cof actors, Meth. Enzymol . 182: 626-646 (1990) 
and Rattan et al . , Protein Synthesis: Posttranslational 
Modifications and Aging, Ann. N.Y. Acad. Sci . 663: 48-62 
(1992) . 

It will be appreciated that the polypeptides of the 
30 present invention are not always entirely linear. Instead, 
polypeptides may be branched as a result of ubiquitination, 
and they may be circular, with or without branching, 
generally as a result of posttranslation events including 
natural processing event and events brought about by human 
35 manipulation which do not occur naturally. Circular, 
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branched and branched circular polypeptides may be 
synthesized by non- translation natural processes and by 
entirely synthetic methods, as well. 

Modifications can occur anywhere in a polypeptide, 
5 including the peptide backbone, the amino acid side-chains 
and the amino or carboxyl termini. In fact, blockage of 
the amino and/or carboxyl group in a polypeptide by a 
covalent modification is common in naturally occurring and 
synthetic polypeptides and such modifications may be 
10 present in polypeptides of the present invention, as well. 
For instance, the amino terminal residue of polypeptides 
3 made in E. coli, prior to proteolytic processing, almost 

B invariably will be N-f ormylmethionine . 

S The modifications that occur in a polypeptide often 

J? 15 will be a function of how it is made. For polypeptides 

m made by expressing a cloned gene in a host, for instance, 

-4 the nature and extent of the modifications, in large part, 

□ will be determined by the host cell posttranslat ional 

Sj modification capacity and the modification signals present 

5 20 in the polypeptide amino acid sequence. For instance, as 

C3 is W ell known, glycosylation often does not occur in 

bacterial hosts such as E. coli. Accordingly, when 
glycosylation is desired, a polypeptide can be expressed in 
a glycosylating host, generally a eukaryotic cell. Insect 
25 cells often carry out the same posttranslat ional 

glycosylations as mammalian cells. Thus, insect cell 
expression systems have been developed to express 
efficiently mammalian proteins having native patterns of 
glycosylation, inter alia. Similar considerations apply to 
30 other modifications. 

It will be appreciated that the same type of 
modification may be present in the same or varying degrees 
at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. 
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In general, as used herein, the term polypeptide 
encompasses all such modifications, particularly those that 
are present in polypeptides synthesized by expressing a 
polynucleotide in a host cell. 
5 VARIANT (S) of polynucleotides or polypeptides, as the 

term is used herein, are polynucleotides or polypeptides 
that differ from a reference polynucleotide or polypeptide, 
respectively . 

With respect to variant polynucleotides, differences 

10 are generally limited so that the nucleotide sequences of 
the reference and the variant are closely similar overall 
and, in many regions, identical. Thus, changes in the 
nucleotide sequence of the variant may be silent. That is, 
they may not alter the amino acids encoded by the 

15 polynucleotide. Where alterations are limited to silent 
changes of this type a variant will encode a polypeptide 
with the same amino acid sequence as the reference. 
Alternatively, changes in the nucleotide sequence of the 
variant may alter the amino acid sequence of a polypeptide 

20 encoded by the reference polynucleotide. Such nucleotide 
changes may result in amino acid substitutions, additions, 
deletions, fusions and truncations in the polypeptide 
encoded by the reference sequence. 

With respect to variant polypeptides, differences are 

25 generally limited so that the sequences of the reference 
and the variant are closely similar overall and, in many 
region, identical. For example, a variant and reference 
polypeptide may differ in amino acid sequence by one or 
more substitutions, additions, deletions, fusions and 

3 0 truncations, which may be present in any combination. 

RECEPTOR MOLECULE, as used herein, refers to 
molecules which bind or interact specifically with LSG 
polypeptides of the present invention and is inclusive not 
only of classic receptors, which are preferred, but also 
35 other molecules that specifically bind to or interact with 
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polypeptides of the invention (which also may be referred 
to as "binding molecules" and "interaction molecules," 
respectively and as "LSG binding or interaction molecules" . 
Binding between polypeptides of the invention and such 
molecules, including receptor or binding or interaction 
molecules may be exclusive to polypeptides of the 
invention, which is very highly preferred, or it may be 
highly specific for polypeptides of the invention, which is 
highly preferred, or it may be highly specific to a group 
of proteins that includes polypeptides of the invention, 
which is preferred, or it may be specific to several groups 
of proteins at least one of which includes polypeptides of 
the invention . 

Receptors also may be non-naturally occurring, such 
as antibodies and antibody-derived reagents that bind to 
polypeptides of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to novel lung specific 
polypeptides and polynucleotides, referred to herein as 
0 LSGs, among other things, as described in greater detail 
below . 

Polynucleotides 

In accordance with one aspect of the present 
invention, there are provided isolated LSG polynucleotides 
5 which encode LSG polypeptides. 

Using the information provided herein, such as the 
polynucleotide sequences set out in SEQ ID NO : 1 , 2, 3, 4, 
5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
21, 22, 37 or 38 a polynucleotide of the present invention 
0 encoding a LSG may be obtained using standard cloning and 
screening procedures, such as those for cloning cDNAs using 
mRNA from cells of a human tumor as starting material. 

Polynucleotides of the present invention may be in 
the form of RNA, such as mRNA, or in the form of DNA, 
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including, for instance, cDNA and genomic DNA obtained by 
cloning or produced by chemical synthetic techniques or by 
a combination thereof. The DNA may be double- stranded or 
single-stranded. Single- stranded DNA may be the coding 
5 strand, also known as the sense strand, or it may be the 
non-coding strand, also referred to as the anti-sense 
strand . 

The coding sequence which encodes the polypeptides 
may be identical to the coding sequence of the 

10 polynucleotides of SEQ IDNO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
It also may be a polynucleotide with a different sequence, 
which, as a result of the redundancy (degeneracy) of the 
genetic code, encodes the same polypeptides as encoded by 

15 SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 

15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 

Polynucleotides of the present invention, such as SEQ 
ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 

16, 17, 18, 19, 20, 21, 22, 37 or 38 which encode these 
20 polypeptides may comprise the coding sequence for the 

mature polypeptide by itself. Polynucleotides of the 
present invention may also comprise the coding sequence for 
the mature polypeptide and additional coding sequences such 
as those encoding a leader or secretory sequence such as a 

25 pre-, or pro- or prepro-protein sequence. Polynucleotides 
of the present invention may also comprise the coding 
sequence of the mature polypeptide, with or without the 
aforementioned additional coding sequences, together with 
additional, non- coding sequences. Examples of additional 

30 non-coding sequences which may be incorporated into the 
polynucleotide of the present invention include, but are 
not limited to, introns and non-coding 5' and 3' sequences 
such as transcribed, non- translated sequences that play a 
role in transcription, mRNA processing including, for 

35 example, splicing and polyadenylat ion signals, ribosome 
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binding and stability of mRNA, and additional coding 
sequence which codes for amino acids such as those which 
provide additional functionalities. Thus, for instance, 
the polypeptide may be fused to a marker sequence such as a 
5 peptide which facilitates purification of the fused 
polypeptide. In certain preferred embodiments of this 
aspect of the invention, the marker sequence is a hexa- 
histidine peptide, such as the tag provided in the pQE 
vector (Qiagen, Inc.), among others, many of which are 
10 commercially available. As described in Gentz et al . (Proc. 
Natl. Acad. Sci . , USA 86: 821-824 (1989)), for instance, 
O hexa-histidine provides for convenient purification of the 

5 fusion protein. The HA tag corresponds to an epitope 

Q derived of influenza hemagglutinin protein (Wilson et al . , 

IH 15 Cell 37: 767 (1984)). 

F! In accordance with the foregoing, the term 

s " "polynucleotide encoding a polypeptide" as used herein 

™! encompasses polynucleotides which include a sequence 

fll encoding a polypeptide of the present invention, 

5S 20 particularly SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 

p 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 

Exemplary polypeptides encoded by the polynucleotides are 
depicted in SEQ ID NO: 39, 40, 41, 42, 43, 44, 45, 46, 47, 
48, 49, 50, 51, 52, 53, 54, 55, or 56. The term 
25 encompasses polynucleotides that include a single 

continuous region or discontinuous regions encoding the 
polypeptide (for example, interrupted by introns) together 
with additional regions, that also may contain coding 
and/or non-coding sequences. 
3 0 The present invention further relates to variants of 

the herein above described polynucleotides which encode for 
fragments, analogs and derivatives of the LSG polypeptides. 
A variant of the polynucleotide may be a naturally 
occurring variant such as a naturally occurring allelic 
35 variant, or it may be a variant that is not known to occur 
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naturally. Such non-naturally occurring variants of the 
polynucleotide may be made by mutagenesis techniques, 
including those applied to polynucleotides, cells or 
organisms . 

5 Among variants in this regard are variants that 

differ from the aforementioned polynucleotides by 
nucleotide substitutions, deletions or additions. The 
substitutions, deletions or additions may involve one or 
more nucleotides. The variants may be altered in coding or 
10 non-coding regions or both. Alterations in the coding 

regions may produce conservative or non-conservative amino 
acid substitutions, deletions or additions. 

Among the particularly preferred embodiments of the 
invention in this regard are polynucleotides encoding 
15 polypeptides having the same amino acid sequence encoded by 
a LSG polynucleotide comprising SEQ ID NO: 1, 2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 37 or 38; variants, analogs, derivatives and fragments 
thereof, and fragments of the variants, analogs and 
20 derivatives. Exemplary polypeptides encoded by these 

polynucleotides are depicted in SEQ ID NO: 39, 40, 41, 42, 
43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55 or 56. 
Further particularly preferred in this regard are LSG 
polynucleotides encoding polypeptide variants, analogs, 
25 derivatives and fragments, and variants, analogs and 

derivatives of the fragments, in which several, a few, 5 to 
10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are 
substituted, deleted or added, in any combination. 
Especially preferred among these are silent substitutions, 
30 additions and deletions, which do not alter the properties 
and activities of the LSG. Also especially preferred in 
this regard are conservative substitutions. Most highly 
preferred are polynucleotides encoding polypeptides having 
the amino acid sequences as polypeptides encoded by SEQ ID 
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NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 22, 37 or 38 without substitutions. 

Further preferred embodiments of the invention are 
LSG polynucleotides that are at least 70% identical to a 
5 polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38 
and polynucleotides which are complementary to such 
polynucleotides. More preferred are LSG polynucleotides 
that comprise a region that is at least 80% identical to a 

10 polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
In this regard, LSG polynucleotides at least 90% identical 
to the same are particularly preferred, and among these 
particularly preferred LSG polynucleotides, those with at 

15 least 95% are especially preferred. Furthermore, those 

with at least 97% are highly preferred among those with at 
least 95%, and among these those with at least 98% and at 
least 99% are particularly highly preferred, with at least 
99% being the most preferred. 

20 Particularly preferred embodiments in this respect, 

moreover, are polynucleotides which encode polypeptides 
which retain substantially the same biological function or 
activity as the mature polypeptides encoded by a 
polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 

25 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
The present invention further relates to 
polynucleotides that hybridize to the herein above- 
described LSG sequences. In this regard, the present 
invention especially relates to polynucleotides which 

3 0 hybridize under stringent conditions to the herein above- 
described polynucleotides. As herein used, the term 
"stringent conditions" means hybridization will occur only 
if there is at least 95% and preferably at least 97% 
identity between the sequences. 
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As discussed additionally herein regarding 
polynucleotide assays of the invention, for instance, 
polynucleotides of the invention as described herein, may 
be used as a hybridization probe for cDNA and genomic DNA 
5 to isolate full-length cDNAs and genomic clones encoding 
LSGs and to isolate cDNA and genomic clones of other genes 
that have a high sequence similarity to these LSGs. Such 
probes generally will comprise at least 15 bases. 
Preferably, such probes will have at least 30 bases and may 
10 have at least 50 bases. 

For example, the coding region of LSG of the present 
invention may be isolated by screening using an 
oligonucleotide probe synthesized from the known DNA 
sequence. A labeled oligonucleotide having a sequence 
15 complementary to that of a gene of the present invention is 
used to screen a library of human cDNA, genomic DNA or mRNA 
to determine which members of the library the probe 
hybridizes with. 

The polynucleotides and polypeptides of the present 

2 0 invention may be employed as research reagents and 

materials for discovery of treatments and diagnostics to 
human disease, as further discussed herein relating to 
polynucleotide assays, inter alia. 

The polynucleotides may encode a polypeptide which is 

25 the mature protein plus additional amino or carboxyl- 

terminal amino acids, or amino acids interior to the mature 
polypeptide (when the mature form has more than one 
polypeptide chain, for instance) . Such sequences may play a 
role in processing of a protein from precursor to a mature 

30 form, may facilitate/protein trafficking, may prolong or 

shorten protein half-life or may facilitate manipulation of 
a protein for assay or production, among other things. As 
generally is the case in situ, the additional amino acids 
may be processed away from the mature protein by cellular 

3 5 enzymes. 
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A precursor protein having the mature form of the 
polypeptide fused to one or more prosequences may be an 
inactive form of the polypeptide. When prosequences are 
removed, such inactive precursors generally are activated. 
5 Some or all of the prosequences may be removed before 
activation. Generally, such precursors are called 
proproteins . 

In sum, a polynucleotide of the present invention may 
encode a mature protein, a mature protein plus a leader 

10 sequence (which may be referred to as a preprotein) , a 
precursor of a mature protein having one or more 
prosequences which are not the leader sequences of a 
preprotein, or a preproprotein, which is a precursor to a 
proprotein, having a leader sequence and one or more 

15 prosequences, which generally are removed during processing 
steps that produce active and mature forms of the 
polypeptide . 
Polypeptides 

The present invention further relates to LSG 

20 polypeptides, preferably polypeptides encoded by a 

polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
Exemplary polypeptides are depicted in SEQ ID NO: 39, 40, 
41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55 

25 or 56. The invention also relates to fragments, analogs 
and derivatives of these polypeptides. The terms 
"fragment," "derivative" and "analog" when referring to the 
polypeptides of the present invention means a polypeptide 
which retains essentially the same biological function or 

30 activity as such polypeptides. Thus, an analog includes a 
proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a 
recombinant polypeptide, a natural polypeptide or a 
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synthetic polypeptide. In certain preferred embodiments it 
is a recombinant polypeptide. 

The fragment, derivative or analog of a polypeptide 
of or the present invention may be (I) one in which one or 
5 more of the amino acid residues are substituted with a 

conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino 
acid residue may or may not be one encoded by the genetic 
code; (ii) one in which one or more of the amino acid 

10 residues includes a substituent group; (iii) one in which 
the mature polypeptide is fused with another compound, such 
as a compound to increase the half-life of the polypeptide 
(for example, polyethylene glycol) ; or (iv) one in which 
the additional amino acids are fused to the mature 

15 polypeptide, such as a leader or secretory sequence or a 
sequence which is employed for purification of the mature 
polypeptide or a proprotein sequence. Such fragments, 
derivatives and analogs are deemed to be within the scope 
of those skilled in the art from the teachings herein. 

2 0 Among preferred variants are those that vary from a 

reference by conservative amino acid substitutions. Such 
substitutions are those that substitute a given amino acid 
in a polypeptide by another amino acid of like 
characteristics. Typically seen as conservative 
25 substitutions are the replacements, one for another, among 
the aliphatic amino acids Ala, Val, Leu and lie; 
interchange of the hydroxyl residues Ser and Thr, exchange 
of the acidic residues Asp and Glu, substitution between 
the amide residues Asn and Gin, exchange of the basic 

3 0 residues Lys and .Arg and replacements among the aromatic 

residues Phe, Tyr . 

The polypeptides and polynucleotides of the present 
invention are preferably provided in an isolated form, and 
preferably are purified to homogeneity. 



Mo? 
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The polypeptides of the present invention include the 
polypeptides encoded by the polynucleotide of SEQ ID NO: 1, 
2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 37 or 38 (in particular the mature 
5 polypeptide) as well as polypeptides which have at least 
75% similarity (preferably at least 75% identity) , more 
preferably at least 90% similarity (more preferably at 
least 90% identity) , still more preferably at least 95% 
similarity (still more preferably at least 95% identity), 
10 to a polypeptide encoded by SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 
37 or 38. Also included are portions of such polypeptides 
generally containing at least 30 amino acids and more 
preferably at least 50 amino acids. Exemplary polypeptides 
5 15 are depicted in SEQ ID NO:39, 40, 41, 42, 43, 44, 45, 46, 

J 47, 48, 49, 50, 51, 52, 53, 54, 55 or 56. 

As known in the art "similarity" between two 
Q polypeptides is determined by comparing the amino acid 

^ sequence and its conserved amino acid substitutes of one 

□ 20 polypeptide sequence with that of a second polypeptide. 

J 3 Fragments or portions of the polypeptides of the 

present invention may be employed for producing the 
corresponding full-length polypeptide by peptide synthesis; 
therefore, the fragments may be employed as intermediates 
25 for producing the full-length polypeptides. Fragments or 
portions of the polynucleotides of the present invention 
may be used to synthesize full-length polynucleotides of 
the present invention. 
Fragments 

3 0 Also among preferred embodiments of this aspect of 

the present invention are polypeptides comprising fragments 
of a polypeptide encoded by a polynucleotide of SEQ ID NO: 
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 37 or 38. In this regard a fragment is 

3 5 a polypeptide having an amino acid sequence that entirely 
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is the same as part but not all of the amino acid sequence 
of the aforementioned LSG polypeptides and variants or 
derivatives thereof . 

Such fragments may be " free-standing, " i.e., not part 
5 of or fused to other amino acids or polypeptides, or they 
may be contained within a larger polypeptide of which they 
form a part or region. When contained within a larger 
polypeptide, the presently discussed fragments most 
preferably form a single continuous region. However, 
10 several fragments may be comprised within a single larger 
polypeptide. For instance, certain preferred embodiments 
O relate to a fragment of a LSG polypeptide of the present 

J3 comprised within a precursor polypeptide designed for 

O expression in a host and having heterologous pre- and pro- 

|S 15 polypeptide regions fused to the amino terminus of the LSG 

If! fragment and an additional region fused to the carboxyl 

terminus of the fragment. Therefore, fragments in one 
ti aspect of the meaning intended herein, refers to the 

fy portion or portions of a fusion polypeptide or fusion 

S 20 protein derived from a LSG polypeptide. 

H As representative examples of polypeptide fragments 

of the invention, there may be mentioned those which have 
from about 15 to about 139 amino acids. In this context 
"about" includes the particularly recited range and ranges 
25 larger or smaller by several, a few, 5, 4, 3, 2 or 1 amino 
acid at either extreme or at both extremes. Highly 
preferred in this regard are the recited ranges plus or 
minus as many as 5 amino acids at either or at both 
extremes. Particularly highly preferred are the recited 
3 0 ranges plus or minus as many as 3 amino acids at either or 
at both the recited extremes. Especially preferred are 
ranges plus or minus 1 amino acid at either or at both 
extremes or the recited ranges with no additions or 
deletions. Most highly preferred of all in this regard are 
35 fragments from about 15 to about 45 amino acids. 
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Among especially preferred fragments of the invention 
are truncation mutants of the LSG polypeptides. Truncation 
mutants include LSG polypeptides having an amino acid 
sequence encoded by a polynucleotide of SEQ ID NO: 1, 2, 3, 

5 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 37 or 38 or variants or derivatives thereof, 
except for deletion of a continuous series of residues 
(that is, a continuous region, part or portion) that 
includes the amino terminus, or a continuous series of 
10 residues that includes the carboxyl terminus or, as in 
double truncation mutants, deletion of two continuous 
series of residues, one including the amino terminus and 
one including the carboxyl terminus. Fragments having the 
size ranges set out herein also are preferred embodiments 

15 of truncation fragments, which are especially preferred 
among fragments generally . 

Also preferred in this aspect of the invention are 
fragments characterized by structural or functional 
attributes of the LSG polypeptides of the present 

20 invention. Preferred embodiments of the invention in this 
regard include fragments that comprise alpha-helix and 
alpha-helix forming regions ("alpha-regions"), beta-sheet 
and beta- sheet -forming regions ("beta-regions"), turn and 
turn-forming regions ("turn-regions"), coil and coil- 

25 forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surf ace- forming 
regions and high antigenic index regions of the LSG 
polypeptides of the present invention. Regions of the 

30 aforementioned types are identified routinely by analysis 
of the amino acid sequences encoded by the polynucleotides 
Of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. Preferred 
regions include Garnier-Robson alpha-regions, beta-regions, 

35 turn-regions and coil -regions , Chou-Fasman alpha- regions , 
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beta-regions and turn-regions, Kyte-Doolittle hydrophilic 
regions and hydrophilic regions, Eisenberg alpha and beta 
amphipathic regions, Karplus-Schulz flexible regions, Emini 
surface-forming regions and Jameson-Wolf high antigenic 
5 index regions. Among highly preferred fragments in this 

regard are those that comprise regions of LSGs that combine 
several structural features, such as several of the 
features set out above. In this regard, the regions 
defined by selected residues of a LSG polypeptide which all 

10 are characterized by amino acid compositions highly 
characteristic of turn-regions, hydrophilic regions, 
flexible-regions, surf ace- forming regions, and high 
antigenic index-regions, are especially highly preferred 
regions. Such regions may be comprised within a larger 

15 polypeptide or may be by themselves a preferred fragment of 
the present invention, as discussed above. It will be 
appreciated that the term "about" as used in this paragraph 
has the meaning set out above regarding fragments in 
general . 

20 Further preferred regions are those that mediate 

activities of LSG polypeptides. Most highly preferred in 
this regard are fragments that have a chemical, biological 
or other activity of a LSG polypeptide, including those 
with a similar activity or an improved activity, or with a 

25 decreased undesirable activity. Highly preferred in this 
regard are fragments that contain regions that are homologs 
in sequence, or in position, or in both sequence and to 
active regions of related polypeptides, and which include 
lung specific-binding proteins. Among particularly 

30 preferred fragments in these regards are truncation 
mutants, as discussed above. 

It will be appreciated that the invention also 
relates to polynucleotides encoding the aforementioned 
fragments, polynucleotides that hybridize to 

35 polynucleotides encoding the fragments, particularly those 
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that hybridize under stringent conditions, and 
polynucleotides such as PCR primers for amplifying 
polynucleotides that encode the fragments. In these 
regards, preferred polynucleotides are those that 
5 correspond to the preferred fragments, as discussed above. 
Fusion Proteins 

In one embodiment of the present invention, the LSG 
polypeptides of the present invention are preferably fused 
to other proteins. These fusion proteins can be used for a 
10 variety of applications. For example, fusion of the present 
polypeptides to His-tag, HA- tag, protein A, IgG domains, 
and maltose binding protein facilitates purification. (See 
also EP A 394,827; Traunecker, et al . , Nature 331: 84-86 
(1988)) Similarly, fusion to IgG-1, IgG-3, and albumin 
15 increases the halflife time in vivo. Nuclear localization 
signals fused to the polypeptides of the present invention 
can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can 
increase or decrease the activity of a fusion protein. 
20 Fusion proteins can also create chimeric molecules having 
more than one function. Finally, fusion proteins can 
increase solubility and/or stability of the fused protein 
compared to the non- fused protein. All of these types of 
fusion proteins described above can be made in accordance 
25 with well known protocols. 

For example, a LSG polypeptide can be fused to an IgG 
molecule via the following protocol. Briefly, the human Fc 
portion of the IgG molecule is PCR amplified using primers 
that span the 5 ' and 3 ' ends of the sequence . These 
3 0 primers also have convenient restriction enzyme sites that 
facilitate cloning into an expression vector, preferably a 
mammalian expression vector. For example, if pC4 
(Accession No. 209646) is used, the human Fc portion can be 
ligated into the BamHI cloning site. In this protocol, the 
35 3' BamHI site must be destroyed. Next, the vector 
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containing the human Fc portion is re-restricted with BamHI 
thereby linearizing the vector, and a LSG polynucleotide of 
the present invention is ligated into this BamHI site. It 
is preferred that the polynucleotide is cloned without a 
5 stop codon, otherwise a fusion protein will not be 
produced . 

If the naturally occurring signal sequence is used to 
produce the secreted protein, pC4 does not need a second 
signal peptide. Alternatively, if the naturally occurring 
10 signal sequence is not used, the vector can be modified to 
include a heterologous signal sequence. (See, e. g., WO 
„ 96/34891.) 
y3 Diagnostic Assays 

The present invention also relates to diagnostic 
15 assays and methods, both quantitative and qualitative for 
Si detecting, diagnosing, monitoring, staging and 

^ prognosticating cancers by comparing levels of LSG in a 

^ human patient with those of LSG in a normal human control. 

Nj For purposes of the present invention, what is meant by LSG 

^ 20 levels is, among other things, native protein expressed by 

■ess? 

P a gene comprising the polynucleotide sequence of SEQ ID NO: 

^ 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 

18, 19, 20, 21, 22, 37 or 38. Exemplary polypeptides 
encoded by these polynucleotides are depicted in SEQ ID 
25 NO:39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 
53, 54, 55 or 56. By "LSG" it is also meant herein 
polynucleotides which, due to degeneracy in genetic coding, 
comprise variations in nucleotide sequence as compared to 
SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
30 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38 but which still 

encode the same protein. The native protein being detected 
may be whole, a breakdown product, a complex of molecules 
or chemically modified. In the alternative, what is meant 
by LSG as used herein, means the native mRNA encoded by a 
35 polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 
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8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 
22, or a contig of SEQ ID NO: 19 or 21, depicted as SEQ ID 
NO: 37 or 38, respectively, levels of the gene comprising 
the polynucleotide sequence of SEQ ID NO : 1, 2, 3, 4, 5, 6, 
5 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 
22, or levels of a polynucleotide which is capable of 
hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. Such 
10 levels are preferably determined in at least one of cells, 
tissues and/or bodily fluids, including determination of 
normal and abnormal levels. Thus, for instance, a 
J3 diagnostic assay in accordance with the invention for 

diagnosing overexpression of LSG protein compared to normal 
Un 15 control bodily fluids, cells, or tissue samples may be used 

%A to diagnose the presence of lung cancer. 

s All the methods of the present invention may 

*Tj optionally include determining the levels of other cancer 

5 markers as well as LSG. Other cancer markers, in addition 

Pi 20 to LSG, useful in the present invention will depend on the 

H= cancer being tested and are known to those of skill in the 

art . 

The present invention provides methods for diagnosing 
the presence of lung cancer by analyzing for changes in 

25 levels of LSG in cells, tissues or bodily fluids compared 
with levels of LSG in cells, tissues or bodily fluids of 
preferably the same type from a normal human control, 
wherein an increase in levels of LSG in the patient versus 
the normal human control is associated with the presence of 

3 0 lung cancer . 

Without limiting the instant invention, typically, 
for a quantitative diagnostic assay a positive result 
indicating the patient being tested has cancer is one in 
which cells, tissues or bodily fluid levels of the cancer 

3 5 marker, such as LSG, are at least two times higher, and 
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most preferably are at least five times higher, than in 
preferably the same cells, tissues or bodily fluid of a 
normal human control . 

The present invention also provides a method of 
5 diagnosing metastatic lung cancer in a patient having lung 
cancer which has not yet metastasized for the onset of 
metastasis. In the method of the present invention, a 
human cancer patient suspected of having lung cancer which 
may have metastasized (but which was not previously known 
10 to have metastasized) is identified. This is accomplished 
by a variety of means known to those of skill in the art. 

In the present invention, determining the presence of 
LSG levels in cells, tissues or bodily fluid, is 
particularly useful for discriminating between lung cancer 
15 which has not metastasized and lung cancer which has 
metastasized. Existing techniques have difficulty 
discriminating between lung cancer which has metastasized 
and lung cancer which has not metastasized and proper 
treatment selection is often dependent upon such knowledge. 
20 In the present invention, the cancer marker levels 

measured in such cells, tissues or bodily fluid is LSG, and 
are compared with levels of LSG in preferably the same 
cells, tissue or bodily fluid type of a normal human 
control. That is, if the cancer marker being observed is 
25 just LSG in serum, this level is preferably compared with 
the level of LSG in serum of a normal human control . An 
increase in the LSG in the patient versus the normal human 
control is associated with lung cancer which has 
metastasized . 

30 Without limiting the instant invention, typically, 

for a quantitative diagnostic assay a positive result 
indicating the cancer in the patient being tested or 
monitored has metastasized is one in which cells, tissues 
or bodily fluid levels of the cancer marker, such as LSG, 

35 are at least two times higher, and most preferably are at 
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least five times higher, than in preferably the same cells, 
tissues or bodily fluid of a normal patient. 

Normal human control as used herein includes a human 
patient without cancer and/or non cancerous samples from 
5 the patient; in the methods for diagnosing or monitoring 
for metastasis, normal human control may preferably also 
include samples from a human patient that is determined by 
reliable methods to have lung cancer which has not 
metastasized. 
10 Staging 

The invention also provides a method of staging lung 
cancer in a human patient. The method comprises 
identifying a human patient having such cancer and 
analyzing cells, tissues or bodily fluid from such human 

15 patient for LSG. The LSG levels determined in the patient 
are then compared with levels of LSG in preferably the same 
cells, tissues or bodily fluid type of a normal human 
control, wherein an increase in LSG levels in the human 
patient versus the normal human control is associated with 

20 a cancer which is progressing and a decrease in the levels 
of LSG (but still increased over true normal levels) is 
associated with a cancer which is regressing or in 
remission . 
Moni toring 

25 Further provided is a method of monitoring lung 

cancer in a human patient having such cancer for the onset 
of metastasis. The method comprises identifying a human 
patient having such cancer that is not known to have 
metastasized; periodically analyzing cells, tissues or 

3 0 bodily fluid from such human patient for LSG; and comparing 
the LSG levels determined in the human patient with levels 
of LSG in preferably the same cells, tissues or bodily 
fluid type of a normal human control, wherein an increase 
in LSG levels in the human patient versus the normal human 

35 control is associated with a cancer which has metastasized. 
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In this method, normal human control samples may also 

include prior patient samples. 

Further provided by this invention is a method of 

monitoring the change in stage of lung cancer in a human 
5 patient having such cancer. The method comprises 

identifying a human patient having such cancer; 

periodically analyzing cells, tissues or bodily fluid from 

such human patient for LSG; and comparing the LSG levels 

determined in the human patient with levels of LSG in 
10 preferably the same cells, tissues or bodily fluid type of 

a normal human control, wherein an increase in LSG levels 
P in the human patient versus the normal human control is 

ijQ associated with a cancer which is progressxng m stage and 

O a decrease in the levels of LSG is associated with a cancer 

{f\ 15 which is regressing in stage or in remission. In this 

?] method, normal human control samples may also include prior 

"Si 

s patient samples . 

Monitoring a patient for onset of metastasis is 
Rj periodic and preferably done on a quarterly basis. 

% 20 However, this may be done more or less frequently depending 

C on the cancer, the particular patient, and the stage of the 

cancer . 

Prognostic Testing and Clinical Trial Monitoring 

The methods described herein can further be utilized 
25 as prognostic assays to identify subjects having or at risk 
of developing a disease or disorder associated with 
increased levels of LSG. The present invention provides a 
method in which a test sample is obtained from a human 
patient and LSG is detected. The presence of higher LSG 
3 0 i ev els as compared to normal human controls is diagnostic 
for the human patient being at risk for developing cancer, 
particularly lung cancer. 

The effectiveness of therapeutic agents to decrease 
expression or activity of the LSGs of the invention can 
35 also be monitored by analyzing levels of expression of the 
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LSGs in a human patient in clinical trials or in in vitro 
screening assays such as in human cells. In this way, the 
gene expression pattern can serve as a marker, indicative 
of the physiological response of the human patient, or 
5 cells as the case may be, to the agent being tested. 
Detection of genetic lesions or mutations 

The methods of the present invention can also be used 
to detect genetic lesions or mutations in LSG, thereby 
determining if a human with the genetic lesion is at risk 
10 for lung cancer or has lung cancer. Genetic lesions can be 
detected, for example, by ascertaining the existence of a 
deletion and/or addition and/or substitution of one or more 
nucleotides from the LSGs of this invention, a chromosomal 
rearrangement of LSG, aberrant modification of LSG (such as 
15 of the methylation pattern of the genomic DNA) , the 

presence of a non-wild type splicing pattern of a mRNA 
transcript of LSG, allelic loss of LSG, and/or 
inappropriate post- translational modification of LSG 
protein. Methods to detect such lesions in the LSG of this 
20 invention are known to those of skill in the art. 

For example, in one embodiment, alterations in a gene 
corresponding to a LSG polynucleotide of the present 
invention are determined via isolation of RNA from entire 
families or individual patients presenting with a phenotype 
25 of interest (such as a disease) is be isolated. cDNA is 

then generated from these RNA samples using protocols known 
in the art. See, e.g. Sambrook et al . (MOLECULAR CLONING: A 
LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. (1989)), which is 
3 0 illustrative of the many laboratory manuals that detail 

these techniques. The cDNA is then used as a template for 
PCR, employing primers surrounding regions of interest in 
SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. PCR conditions 
35 typically consist of 35 cycles at 95°C for 30 seconds; 60- 
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120 seconds at 52-58°C; and 60-120 seconds at 70°C, using 
buffer solutions described in Sidransky, D., et al . , 
Science 252: 706 (1991). PCR products are sequenced using 
primers labeled at their 5' end with T4 polynucleotide 
5 kinase, employing SequiTherm Polymerase (Epicentre 

Technologies) . The intron-exon borders of selected exons 
are also determined and genomic PCR products analyzed to 
confirm the results. PCR products harboring suspected 
mutations are then cloned and sequenced to validate the 
10 results of the direct sequencing. PCR products are cloned 
into T-tailed vectors as described in Holton, T. A. and 

Q Graham, M. W. , Nucleic Acids Research, 19 : 1156 (1991) and 

5 sequenced with T7 polymerase (United States Biochemical) . 

^ Affected individuals are identified by mutations not 

Ul 15 present in unaffected individuals. 

^\ Genomic rearrangements can also be observed as a 

«" method of determining alterations in a gene corresponding 

jfj to a polynucleotide. In this method, genomic clones are 

flU nick-translated with digoxigenin deoxy-uridine 

20 5 1 triphosphate (Boehringer Manheim) , and FISH is performed 
M» as described in Johnson, C. et al . , Methods Cell Biol. 35: 

73-99 (1991) . Hybridization with a labeled probe is carried 
out using a vast excess of human DNA for specific 
hybridization to the corresponding genomic locus. 
25 Chromosomes are counterstained with 4 , 6-diamino-2- 

phenylidole and propidium iodide, producing a combination 
of C-and R-bands . Aligned images for precise mapping are 
obtained using a triple-band filter set (Chroma Technology, 
Brattleboro, VT) in combination with a cooled charge- 
30 coupled device camera (Photometries, Tucson, AZ) and 

variable excitation wavelength filters (Johnson et al . , 
Genet. Anal. Tech. Appl . , 8: 75 (1991)). Image collection, 
analysis and chromosomal fractional length measurements are 
performed using the ISee Graphical Program System 
3 5 (Inovision Corporation, Durham, NC) . Chromosome 
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alterations of the genomic region hybridized by the probe 
are identified as insertions, deletions, and 

translocations. These alterations are used as a diagnostic 
marker for an associated disease. 
5 Assay Techniques 

Assay techniques that can be used to determine levels 
of gene expression (including protein levels) , such as LSG 
of the present invention, in a sample derived from a 
patient are well known to those of skill in the art. Such 
10 assay methods include, without limitation, 

radioimmunoassays, reverse transcriptase PCR (RT-PCR) 
assays, immunohistochemistry assays, in situ hybridization 
assays, competitive-binding assays, Western Blot analyses, 
ELISA assays and proteomic approaches: two-dimensional gel 
15 electrophoresis (2D electrophoresis) and non-gel based 

approaches such as mass spectrometry or protein interaction 
profiling. Among these, ELISAs are frequently preferred to 
diagnose a gene's expressed protein in biological fluids. 
An ELISA assay initially comprises preparing an 
20 antibody, if not readily available from a commercial 

source, specific to LSG, preferably a monoclonal antibody. 
In addition a reporter antibody generally is prepared which 
binds specifically to LSG. The reporter antibody is 
attached to a detectable reagent such as radioactive, 
25 fluorescent or enzymatic reagent, for example horseradish 
peroxidase enzyme or alkaline phosphatase. 

To carry out the ELISA, antibody specific to LSG is 
incubated on a solid support, e.g. a polystyrene dish, that 
binds the antibody. Any free protein binding sites on the 
30 dish are then covered by incubating with a non-specific 

protein such as bovine serum albumin. Next, the sample to 
be analyzed is incubated in the dish, during which time LSG 
binds to the specific antibody attached to the polystyrene 
dish. Unbound sample is washed out with buffer. A reporter 
35 antibody specifically directed to LSG and linked to a 
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detectable reagent such as horseradish peroxidase is placed 
in the dish resulting in binding of the reporter antibody 
to any monoclonal antibody bound to LSG. Unattached 
reporter antibody is then washed out. Reagents for 
5 peroxidase activity, including a colorimetric substrate are 
then added to the dish. Immobilized peroxidase, linked to 
LSG antibodies, produces a colored reaction product. The 
amount of color developed in a given time period is 
proportional to the amount of LSG protein present in the 
10 sample. Quantitative results typically are obtained by 
reference to a standard curve . 

A competition assay can also be employed wherein 
antibodies specific to LSG are attached to a solid support 
and labeled LSG and a sample derived from the host are 
15 passed over the solid support. The amount of label 

detected which is attached to the solid support can be 
correlated to a quantity of LSG in the sample. 

Using all or a portion of a nucleic acid sequence of 
LSG of the present invention as a hybridization probe, 
20 nucleic acid methods can also be used to detect LSG mRNA as 
a marker for lung cancer. Polymerase chain reaction (PCR) 
and other nucleic acid methods, such as ligase chain 
reaction (LCR) and nucleic acid sequence based 
amplification (NASBA) , can be used to detect malignant 
25 cells for diagnosis and monitoring of various malignancies. 
For example, reverse- transcriptase PCR (RT-PCR) is a 
powerful technique which can be used to detect the presence 
of a specific mRNA population in a complex mixture of 
thousands of other mRNA species. In RT-PCR, an mRNA 
3 0 species is first reverse transcribed to complementary DNA 
(cDNA) with use of the enzyme reverse transcriptase; the 
cDNA is then amplified as in a standard PCR reaction. RT- 
PCR can thus reveal by amplification the presence of a 
single species of mRNA. Accordingly, if the mRNA is highly 



4j 



DEX-0214 - 41 - PATENT 

specific for the cell that produces it, RT-PCR can be used 
to identify the presence of a specific type of cell. 

Hybridization to clones or oligonucleotides arrayed 
on a solid support (i.e. gridding) can be used to both 
5 detect the expression of and quantitate the level of 

expression of that gene. In this approach, a cDNA encoding 
the LSG gene is fixed to a substrate. The substrate may be 
of any suitable type including but not limited to glass, 
nitrocellulose, nylon or plastic. At least a portion of 
10 the DNA encoding the LSG gene is attached to the substrate 
and then incubated with the analyte, which may be RNA or a 
complementary DNA (cDNA) copy of the RNA, isolated from the 
tissue of interest. Hybridization between the substrate 
bound DNA and the analyte can be detected and quantitated 
15 by several means including but not limited to radioactive 
labeling or fluorescence labeling of the analyte or a 
secondary molecule designed to detect the hybrid. 
Quantitation of the level of gene expression can be done by 
comparison of the intensity of the signal from the analyte 
20 compared with that determined from known standards. The 
standards can be obtained by in vitro transcription of the 
target gene, quantitating the yield, and then using that 
material to generate a standard curve. 

Of the proteomic approaches, 2D electrophoresis is a 
25 technique well known to those in the art. Isolation of 
individual proteins from a sample such as serum is 
accomplished using sequential separation of proteins by 
different characteristics usually on polyacrylamide gels. 
First, proteins are separated by size using an electric 
3 0 current. The current acts uniformly on all proteins, so 
smaller proteins move farther on the gel than larger 
proteins. The second dimension applies a current 
perpendicular to the first and separates proteins not on 
the basis of size but on the specific electric charge 
35 carried by each protein. Since no two proteins with 
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different sequences are identical on the basis of both size 
and charge, the result of a 2D separation is a square gel 
in which each protein occupies a unique spot. Analysis of 
the spots with chemical or antibody probes, or subsequent 
5 protein microsequencing can reveal the relative abundance 
of a given protein and the identity of the proteins in the 
sample . 

The above tests can be carried out on samples derived 
from a variety of cells, bodily fluids and/or tissue 

10 extracts such as homogenates or solubilized tissue obtained 
from a patient. Tissue extracts are obtained routinely 
from tissue biopsy and autopsy material. Bodily fluids 
useful in the present invention include blood, urine, 
saliva or any other bodily secretion or derivative thereof. 

15 By blood it is meant to include whole blood, plasma, serum 
or any derivative of blood. 

In Vivo Targeting of LSG/Lung Cancer Therapy 

Identification of this LSG is also useful in the 
rational design of new therapeutics for imaging and 

20 treating cancers, and in particular lung cancer. For 

example, in one embodiment, antibodies which specifically 
bind to LSG can be raised and used in vivo in patients 
suspected of suffering from lun/cancer. Antibodies which 
specifically bind LSG can be injected into a patient 

25 suspected of having lung cancer for diagnostic and/or 

therapeutic purposes. Thus, another aspect of the present 
invention provides for a method for preventing the onset 
and treatment of lung cancer in a human patient in need of 
such treatment by administering to the patient an effective 

3 0 amount of antibody. By "effective amount" it is meant the 
amount or concentration of antibody needed to bind to the 
target antigens expressed on the tumor to cause tumor 
shrinkage for surgical removal, or disappearance of the 
tumor. The binding of the antibody to the overexpressed 

35 LSG is believed to cause the death of the cancer cell 
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expressing such LSG. The preparation and use of antibodies 
for in vivo diagnosis and treatment is well known in the 
art. For example, ant ibody-chelators labeled with Indium- 
Ill have been described for use in the 
5 radioimmunoscintographic imaging of carcinoembryonic 

antigen expressing tumors (Sumerdon et al . Nucl . Med. Biol. 

1990 17:247-254). In particular, these antibody-chelators 
have been used in detecting tumors in patients suspected of 
having recurrent colorectal cancer (Griffin et al . J. Clin. 

10 One. 1991 9:631-640). Antibodies with paramagnetic ions as 
labels for use in magnetic resonance imaging have also been 
described (Lauffer, R.B. Magnetic Resonance in Medicine 

1991 22:339-342) . Antibodies directed against LSG can be 
used in a similar manner. Labeled antibodies which 

15 specifically bind LSG can be injected into patients 
suspected of having lung cancer for the purpose of 
diagnosing or staging of the disease status of the patient. 
The label used will be selected in accordance with the 
imaging modality to be used. For example, radioactive 

20 labels such as Indium-Ill, Technetium- 99m or Iodine-131 can 
be used for planar scans or single photon emission computed 
tomography (SPECT) . Positron emitting labels such as 
Fluorine-19 can be used in positron emission tomography. 
Paramagnetic ions such as Gadlinium (III) or Manganese (II) 

2 5 can be used in magnetic resonance imaging (MRI) . Presence 

of the label, as compared to imaging of normal tissue, 
permits determination of the spread of the cancer. The 
amount of label within an organ or tissue also allows 
determination of the presence or absence of cancer in that 

3 0 organ or tissue. 

Antibodies which can be used in in vivo methods 
include polyclonal, monoclonal and omniclonal antibodies 
and antibodies prepared via molecular biology techniques. 
Antibody fragments and aptamers and single- stranded 
3 5 oligonucleotides such as those derived from an in vitro 
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evolution protocol referred to as SELEX and well known to 
those skilled in the art can also be used. 
Screening Assays 

The present invention also provides methods for 
5 identifying modulators which bind to LSG protein or have a 
modulatory effect on the expression or activity of LSG 
protein. Modulators which decrease the expression or 
activity of LSG protein are believed to be useful in 
treating lung cancer. Such screening assays are known to 
10 those of skill in the art and include, without limitation, 
cell-based assays and cell free assays. 

Small molecules predicted via computer imaging to 
specifically bind to regions of LSG can also be designed, 
synthesized and tested for use in the imaging and treatment 
15 of lung cancer. Further, libraries of molecules can be 
screened for potential anticancer agents by assessing the 
ability of the molecule to bind to the LSGs identified 
herein. Molecules identified in the library as being 
capable of binding to LSG are key candidates for further 
20 evaluation for use in the treatment of lung cancer. In a 
preferred embodiment, these molecules will downregulate 
expression and/or activity of LSG in cells. 
Adoptive Immunotherapy and Vaccines 

Adoptive immunotherapy of cancer refers to a 
25 therapeutic approach in which immune cells with an 

antitumor reactivity are administered to a tumor-bearing 
host, with the aim that the cells mediate either directly 
or indirectly, the regression of an established tumor. 
Transfusion of lymphocytes, particularly T lymphocytes, 
30 falls into this category and investigators at the National 
Cancer Institute (NCI) have used autologous reinfusion of 
peripheral blood lymphocytes or tumor- infiltrating 
lymphocytes (TIL) , T cell cultures from biopsies of 
subcutaneous lymph nodules, to treat several human cancers 
35 (Rosenberg, S. A., U.S. Patent No. 4,690,914, issued Sep. 
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1, 1987; Rosenberg, S. A., et al . , 1988, N. England J. Med. 
319 : 1676-1680) . 

The present invention relates to compositions and 
methods of adoptive immunotherapy for the prevention and/or 
treatment of primary and metastatic lung cancer in humans 
using macrophages sensitized to the antigenic LSG 
molecules, with or without non-covalent complexes of heat 
shock protein (hsp) . Antigenicity or immunogenicity of the 
LSG is readily confirmed by the ability of the LSG protein 
or a fragment thereof to raise antibodies or educate naive 
effector cells, which in turn lyse target cells expressing 
the antigen (or epitope) . 

Cancer cells are, by definition, abnormal and contain 
proteins which should be recognized by the immune system as 
foreign since they are not present in normal tissues. 
However, the immune system often seems to ignore this 
abnormality and fails to attack tumors. The foreign LSG 
proteins that are produced by the cancer cells can be used 
to reveal their presence. The LSG is broken into short 
fragments, called tumor antigens, which are displayed on 
the surface of the cell. These tumor antigens are held or 
presented on the cell surface by molecules called MHC, of 
which there are two types: class I and II. Tumor antigens 
in association with MHC class I molecules are recognized by 
cytotoxic T cells while antigen-MHC class II complexes are 
recognized by a second subset of T cells called helper 
cells. These cells secrete cytokines which slow or stop 
tumor growth and help another type of white blood cell, B 
cells, to make antibodies against the tumor cells. 

In adoptive immunotherapy, T cells or other antigen 
presenting cells (APCs) are stimulated outside the body (ex 
vivo) , using the tumor specific LSG antigen. The 
stimulated cells are then reinfused into the patient where 
they attack the cancerous cells. Research has shown that 
using both cytotoxic and helper T cells is far more 
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effective than using either subset alone. Additionally, 
the LSG antigen may be complexed with heat shock proteins 
to stimulate the APCs as described in U.S. Patent No. 
5, 985, 270 . 

5 The APCs can be selected from among those antigen 

presenting cells known in the art including, but not 
limited to, macrophages, dendritic cells, B lymphocytes, 
and a combination thereof, and are preferably macrophages. 
In a preferred use, wherein cells are autologous to the 
10 individual, autologous immune cells such as lymphocytes, 
macrophages or other APCs are used to circumvent the issue 
of whom to select as the donor of the immune cells for 
adoptive transfer. Another problem circumvented by use of 
J2 autologous immune cells is graft versus host disease which 

15 can be fatal if unsuccessfully treated. 

In adoptive immunotherapy with gene therapy, DNA of 
the LSG can be introduced into effector cells similarly as 
in conventional gene therapy. This can enhance the 
cytotoxicity of the effector cells to tumor cells as they 
20 have been manipulated to produce the antigenic protein 
resulting in improvement of the adoptive immunotherapy. 

LSG antigens of this invention are also useful as 
components of lung cancer vaccines. The vaccine comprises 
an immunogenically stimulatory amount of a LSG antigen. 
25 Immunogenically stimulatory amount refers to that amount of 
antigen that is able to invoke the desired immune response 
in the recipient for the amelioration, or treatment of lung 
cancer. Effective amounts may be determined empirically by 
standard procedures well known to those skilled in the art. 
3 0 The LSG antigen may be provided in any one of a 

number of vaccine formulations which are designed to induce 
the desired type of immune response, e.g., antibody and/or 
cell mediated. Such formulations are known in the art and 
include, but are not limited to, formulations such as those 
35 described in U.S. Patent 5,585,103. Vaccine formulations 
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of the present invention used to stimulate immune responses 
can also include pharmaceutical^ acceptable adjuvants. 
Vectors, host cells, expression 

The present invention also relates to vectors which 
5 include polynucleotides of the present invention, host 

cells which are genetically engineered with vectors of the 
invention and the production of polypeptides of the 
invention by recombinant techniques. 

Host cells can be genetically engineered to 
10 incorporate LSG polynucleotides and express LSG 

polypeptides of the present invention. For instance, LSG 
hi polynucleotides may be introduced into host cells using 

well known techniques of infection, transduction, 
2 transf ection, transvection and transformation. The LSG 

W 15 polynucleotides may be introduced alone or with other 

£j polynucleotides. Such other polynucleotides may be 

^ introduced independently, co- introduced or introduced 

"1 joined to the LSG polynucleotides of the invention. 

[| For example, LSG polynucleotides of the invention may 

5 20 be transf ected into host cells with another, separate, 

M= polynucleotide encoding a selectable marker, using standard 

techniques for co- transf ection and selection in, for 
instance, mammalian cells. In this case, the 
polynucleotides generally will be stably incorporated into 
25 the host cell genome. 

Alternatively, the LSG polynucleotide may be joined 
to a vector containing a selectable marker for propagation 
in a host. The vector construct may be introduced into 
host cells by the aforementioned techniques. Generally, a 
3 0 plasmid vector is introduced as DNA in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a 
charged lipid. Electroporation also may be used to 
introduce LSG polynucleotides into a host. If the vector 
is a virus, it may be packaged in vitro or introduced into 
35 a packaging cell and the packaged virus may be transduced 



DEX-0214 - 48 - PATENT 

into cells. A wide variety of well known techniques 
conducted routinely by those of skill in the art are 
suitable for making LSG polynucleotides and for introducing 
LSG polynucleotides into cells in accordance with this 
5 aspect of the invention. Such techniques are reviewed at 
length in reference texts such as Sambrook et al . , 
previously cited herein. 

Vectors which may be used in the present invention 
include, for example, plasmid vectors, single- or double- 
10 stranded phage vectors, and single- or double- stranded RNA 
or DNA viral vectors. Such vectors may be introduced into 
cells as polynucleotides, preferably DNA, by well known 
y3 techniques for introducing DNA and RNA into cells. The 

JSS. 

vectors, in the case of phage and viral vectors, also may 
?B 15 be and preferably are introduced into cells as packaged or 

*f* encapsidated virus by well known techniques for infection 

^ and transduction. Viral vectors may be replication 

q competent or replication defective. In the latter case 

y_ viral propagation generally will occur only in 

q 20 complementing host cells. 

C3 Preferred vectors for expression of polynucleotides 

and polypeptides of the present invention include, but are 
not limited to, vectors comprising cis-acting control 
regions effective for expression in a host operatively 

25 linked to the polynucleotide to be expressed. Appropriate 
trans-acting factors either are supplied by the host, 
supplied by a complementing vector or supplied by the 
vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the 

30 vectors provide for specific expression. Such specific 

expression may be inducible expression or expression only 
in certain types of cells or both inducible and cell- 
specific. Particularly preferred among inducible vectors 
are vectors that can be induced to express by environmental 

3 5 factors that are easy to manipulate, such as temperature 
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and nutrient additives. A variety of vectors suitable to 
this aspect of the invention, including constitutive and 
inducible expression vectors for use in prokaryotic and 
eukaryotic hosts, are well known and employed routinely by 

5 those of skill in the art. 

The engineered host cells can be cultured in 
conventional nutrient media which may be modified as 
appropriate for, inter alia, activating promoters, 
selecting transf ormants or amplifying genes. Culture 
10 conditions such as temperature, pH and the like, previously 
used with the host cell selected for expression, generally 
will be suitable for expression of LSG polypeptides of the 
present invention . 

A great variety of expression vectors can be used to 

15 express LSG polypeptides of the invention. Such vectors 
include chromosomal, episomal and virus-derived vectors. 
Vectors may be derived from bacterial plasmids, from 
bacteriophage, from yeast epi somes, from yeast chromosomal 
elements, from viruses such as baculoviruses , papova 

20 viruses, such as SV40, vaccinia viruses, adenoviruses, fowl 
pox viruses, pseudorabies viruses and retroviruses, and 
from combinations thereof such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids 
and phagemids. All may be used for expression in accordance 

25 with this aspect of the present invention. Generally, any 
vector suitable to maintain, propagate or express 
polynucleotides to express a polypeptide in a host may be 
used for expression in this regard. 

The appropriate DNA sequence may be inserted into the 

3 0 vector by any of a variety of well-known and routine 

techniques. In general, a DNA sequence for expression is 
joined to an expression vector by cleaving the DNA sequence 
and the expression vector with one or more restriction 
endonucleases and then joining the restriction fragments 

35 together using T4 DNA ligase. Procedures for restriction 
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and ligation that can be used to this end are well known 
and routine to those of skill. Suitable procedures in this 
regard, and for constructing expression vectors using 
alternative techniques, which also are well known and 
routine to those skill, are set forth in great detail in 
Sambrook et al . cited elsewhere herein. 

The DNA sequence in the expression vector is 
operatively linked to appropriate expression control 
sequence (s), including, for instance, a promoter to direct 
mRNA transcription. Representative promoters include the 
phage lambda PL promoter, the E . coli lac, trp and tac 
promoters, the SV40 early and late promoters, and promoters 
of retroviral LTRs , to name just a few of the well-known 
promoters. It will be understood that numerous promoters 
not mentioned are also suitable for use in this aspect of 
the invention and are well known and readily may be 
employed by those of skill in the manner illustrated by the 
discussion and the examples herein. 

In general, expression constructs will contain sites 
for transcription initiation and termination, and, in the 
transcribed region, a ribosome binding site for 
translation. The coding portion of the mature transcripts 
expressed by the constructs will include a translation 
initiating AUG at the beginning and a termination codon 
appropriately positioned at the end of the polypeptide to 
be translated. 

In addition, the constructs may contain control 
regions that regulate as well as engender expression. 
Generally, in accordance with many commonly practiced 
procedures, such regions will operate by controlling 
transcription, such as repressor binding sites and 
enhancers, among others. 

Vectors for propagation and expression generally will 
include selectable markers. Such markers also may be 
suitable for amplification or the vectors may contain 
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additional markers for this purpose. In this regard, the 
expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for 
selection of transformed host cells. Preferred markers 
include dihydrof olate reductase or neomycin resistance for 
eukaryotic cell culture, and tetracycline or ampicillin 
resistance genes for culturing in E. coli and other 
bacteria . 

The vector containing the appropriate DNA sequence as 
described elsewhere herein, as well as an appropriate 
promoter, and other appropriate control sequences, may be 
introduced into an appropriate host using a variety of well 
known techniques suitable to expression therein of a 
desired polypeptide. Representative examples of 
appropriate hosts include bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal 
cells, such as yeast cells; insect cells such as Drosophila 
S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Hosts for a 
great variety of expression constructs are well known, and 
those of skill will be enabled by the present disclosure 
readily to select a host for expressing a LSG polypeptide 
in accordance with this aspect of the present invention. 

More particularly, the present invention also 
includes recombinant constructs, such as expression 
constructs, comprising one or more of the sequences 
described above. The constructs comprise a vector, such as 
a plasmid or viral vector, into which such LSG sequence of 
the invention has been inserted. The sequence may be 
inserted in a forward or reverse orientation. In certain 
preferred embodiments in this regard, the construct further 
comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of 
suitable vectors and promoters are known to those of skill 
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in the art, and there are many commercially available 
vectors suitable for use in the present invention. 

The following vectors, which are commercially 
available, are provided by way of example. Among vectors 
5 preferred for use in bacteria are pQE70, pQE60 and pQE-9, 
available from Qiagen; pBS vectors, Phagescript vectors, 
Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, 
available from Stratagene; and ptrc99a, pKK223-3, pKK233-3, 
pDR540, pRIT5 available from Pharmacia. Among preferred 
10 eukaryotic vectors are PWLNEO, pSV2CAT, pOG44, pXTl and pSG 
available from Stratagene; and pSVK3 , pBPV, pMSG and pSVL 
_ available from Pharmacia. These vectors are listed solely 

J3 by way of illustration of the many commercially available 

^ and well known vectors that are available to those of skill 

y3 15 in the art for use in accordance with this aspect of the 

m 

m present invention. It will be appreciated by those of 

• 

~-4 skill in the art upon reading this disclosure that any 

g other plasmid or vector suitable for introduction, 

y. maintenance, propagation and/or expression of a LSG 

f5 2 0 polynucleotide or polypeptide of the invention in a host 

O may be used in this aspect of the invention. 

Promoter regions can be selected from any desired 
gene using vectors that contain a reporter transcription 
unit lacking a promoter region, such as a chloramphenicol 
25 acetyl transferase ("cat") transcription unit, downstream 
of a restriction site or sites for introducing a candidate 
promoter fragment; i.e., a fragment that may contain a 
promoter. As is well known, introduction into the vector 
of a promoter-containing fragment at the restriction site 
3 0 upstream of the cat gene engenders production of CAT 
activity detectable by standard CAT assays. Vectors 
suitable to this end are well known and readily available. 
Two such vectors are pKK232-8 and pCM7 . Thus, promoters 
for expression of LSG polynucleotides of the present 
35 invention include, not only well known and readily 
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available promoters, but also promoters that readily may be 
obtained by the foregoing technique, using a reporter gene. 

Among known bacterial promoters suitable for 
expression of polynucleotides and polypeptides in 
5 accordance with the present invention are the E. coli laci 
and lacZ promoters, the T3 and T7 promoters, the gpt 
promoter, the lambda PR, PL promoters and the trp promoter. 
Among known eukaryotic promoters suitable in this regard 
are the CMV immediate early promoter, the HSV thymidine 
10 kinase promoter, the early and late SV4 0 promoters, the 
promoters of retroviral LTRs, such as those of the Rous 
sarcoma virus ("RSV"), and metallothionein promoters, such 
rS as the mouse metallothionein- I promoter. 

".sis 

£ Selection of appropriate vectors and promoters for 

3 15 expression in a host cell is a well known procedure and the 

% requisite techniques for expression vector construction, 

Sj introduction of the vector into the host and expression in 

L. the host are routine skills in the art. 

Q The present invention also relates to host cells 

% 20 containing the above -described constructs. The host cell 

S can be a higher eukaryotic cell, such as a mammalian cell, 

^ or a lower eukaryotic cell, such as a yeast cell. 

Alternatively, the host cell can be a prokaryotic cell, 
such as a bacterial cell. 
25 Introduction of the construct into the host cell can 

be effected by calcium phosphate transf ect ion, DEAE-dextran 
mediated transf ection, cationic lipid-mediated 
transf ection, electroporation, transduction, infection or 
other methods. Such methods are described in many standard 
30 laboratory manuals, such as Davis et al . BASIC METHODS IN 
MOLECULAR BIOLOGY, (1986) . 

Constructs in host cells can be used in a 
conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternatively, LSG polypeptides 
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of the invention can be synthetically produced by 
conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, 
yeast, bacteria, or other cells under the control of 
5 appropriate promoters. Cell-free translation systems can 
also be employed to produce such proteins using RNAs 
derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with 
prokaryotic and eukaryotic hosts are described by Sambrook 
10 et al. cited elsewhere herein. 

Generally, recombinant expression vectors will 
include origins of replication, a promoter derived from a 
highly-expressed gene to direct transcription of a 
downstream structural sequence, and a selectable marker to 
S 15 permit isolation of vector containing cells after exposure 

ip to the vector. Among suitable promoters are those derived 

y from the genes that encode glycolytic enzymes such as 3- 

O phosphoglycerate kinase ( " PGK" ) , a-f actor, acid 

ijj phosphatase, and heat shock proteins, among others. 

5 20 Selectable markers include the ampicillin resistance gene 

U of E. coli and the trpl gene of S. cerevisiae. 

Transcription of DNA encoding the LSG polypeptides of 
the present invention by higher eukaryotes may be increased 
by inserting an enhancer sequence into the vector. 
25 Enhancers are cis -acting elements of DNA, usually about 
from 10 to 300 base pairs (bp) that act to increase 
transcriptional activity of a promoter in a given host 
cell-type. Examples of enhancers include the SV40 
enhancer, which is located on the late side of the 
3 0 replication origin at bp 100 to 270, the cytomegalovirus 
early promoter enhancer, the polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

A polynucleotide of the present invention, encoding a 
heterologous structural sequence of a LSG polypeptide of 
35 the present invention, generally will be inserted into the 
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vector using standard techniques so that it is operably 
linked to the promoter for expression. The polynucleotide 
will be positioned so that the transcription start site is 
located appropriately 5 ' to a ribosome binding site. The 
ribosome binding site will be 5 1 to the AUG that initiates 
translation of the polypeptide to be expressed. Generally, 
there will be no other open reading frames that begin with 
an initiation codon, usually AUG, lying between the 
ribosome binding site and the initiating AUG. Also, 
i generally, there will be a translation stop codon at the 
end of the polypeptide and there will be a polyadenylation 
signal and a transcription termination signal appropriately 
disposed at the 3' end of the transcribed region. 

Appropriate secretion signals may be incorporated 
5 into the expressed polypeptide for secretion of the 
translated protein into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the 
extracellular environment. The signals may be endogenous 
to the polypeptide or they may be heterologous signals. 
0 The polypeptide may be expressed in a modified form, 

such as a fusion protein, and may include not only 
secretion signals but also additional heterologous 
functional regions. Thus, for instance, a region of 
additional amino acids, particularly charged amino acids, 
5 may be added to the N- terminus of the polypeptide to 

improve stability and persistence in the host cell during 
purification or during subsequent handling and storage. A 
region also may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final 
0 preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate 
purification, among others, are familiar and routine 
techniques in the art . 
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Suitable prokaryotic hosts for propagation, 
maintenance or expression of LSG polynucleotides and 
polypeptides in accordance with the invention include 
Escherichia coli, Bacillus subtilis and Salmonella 
5 typhimurium. Various species of Pseudomonas , Streptomyces , 
and Staphylococcus are suitable hosts in this regard. Many 
other hosts also known to those of skill may also be 
employed in this regard. 

As a representative, but non- limiting example, useful 
10 expression vectors for bacterial use can comprise a 
selectable marker and bacterial origin of replication 
derived from commercially available plasmids comprising 
genetic elements of the well known cloning vector pBR322. 
Such commercial vectors include, for example, pKK223-3 
15 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 
(Promega Biotec, Madison, Wis., USA). These pBR322 
"backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 
Following transformation of a suitable host strain and 
2 0 growth of the host strain to an appropriate cell density, 
where the selected promoter is inducible it is induced by 
appropriate means (e.g., temperature shift or exposure to 
chemical inducer) and cells are cultured for an additional 
period. Cells typically then are harvested by 
25 centrifugation, disrupted by physical or chemical means, 
and the resulting crude extract retained for further 
purification. Microbial cells employed in expression of 
proteins can be disrupted by any convenient method, 
including freeze- thaw cycling, sonication, mechanical 
30 disruption, or use of cell lysing agents, such methods are 
well know to those skilled in the art. 

Various mammalian cell culture systems can be 
employed for expression, as well. An exemplary mammalian 
expression systems is the COS -7 line of monkey kidney 
35 fibroblasts described in Gluzman et al . , Cell 23: 175 
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(1981) . Other mammalian cell lines capable of expressing a 
compatible vector include for example, the C127, 3T3 , CHO, 
HeLa, human kidney 293 and BHK cell lines. Mammalian 
expression vectors comprise an origin of replication, a 
5 suitable promoter and enhancer, and any ribosome binding 
sites, polyadenylation sites, splice donor and acceptor 
sites, transcriptional termination sequences, and 5' 
flanking non-transcribed sequences that are necessary for 
expression. In certain preferred embodiments in this regard 

10 DNA sequences derived from the SV40 splice sites, and the 
SV40 polyadenylation sites are used for required non- 
transcribed genetic elements of these types. 

LSG polypeptides can be recovered and purified from 
recombinant cell cultures by well-known methods including 

15 ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Most preferably, high performance 

2 0 liquid chromatography ("HPLC") is employed for 

purification. Well known techniques for refolding proteins 
may be employed to regenerate active conformation when the 
polypeptide is denatured during isolation and or 
purification . 

25 LSG polypeptides of the present invention include 

naturally purified products, products of chemical synthetic 
procedures, and products produced by recombinant techniques 
from a prokaryotic or eukaryotic host, including, for 
example, bacterial, yeast, higher plant, insect and 

30 mammalian cells. Depending upon the host employed in a 

recombinant production procedure, the LSG polypeptides of 
the present invention may be glycosylated or may be non- 
glycosylated. In addition, LSG polypeptides of the 
invention may also include an initial modified methionine 
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residue, in some cases as a result of host -mediated 
processes . 

LSG polynucleotides and polypeptides may be used in 
accordance with the present invention for a variety of 
5 applications, particularly those that make use of the 

chemical and biological properties of the LSGs . Additional 
applications relate to diagnosis and to treatment of 
disorders of cells, tissues and organisms. These aspects of 
the invention are illustrated further by the following 

10 discussion. 

Polynucleotide assays 

As discussed in some detail supra, this invention is 
also related to the use of LSG polynucleotides to detect 
complementary polynucleotides such as, for example, as a 

15 diagnostic reagent. Detection of a mutated form of LSG 
associated with a dysfunction will provide a diagnostic 
tool that can add to or define a diagnosis of a disease or 
susceptibility to a disease which results from under- 
expression, over-expression or altered expression of a LSG, 

2 0 such as, for example, a susceptibility to inherited lung 

cancer . 

Individuals carrying mutations in a human LSG gene 
may be detected at the DNA level by a variety of 
techniques. Nucleic acids for diagnosis may be obtained 
25 from a patient's cells, such as from blood, urine, saliva, 
tissue biopsy and autopsy material. The genomic DNA may be 
used directly for detection or may be amplified 
enzymatically using PCR prior to analysis (Saiki et al . , 
Nature, 324: 163-166 (1986)) . RNA or cDNA may also be used 

3 0 in a similar manner. As an example, PCR primers 

complementary to a LSG polynucleotide of SEQ ID NO: 1, 2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 37 or 38 can be used to identify and 
analyze LSG expression and mutations. For example, 
35 deletions and insertions can be detected by a change in 
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size of the amplified product in comparison to the normal 
genotype. Point mutations can be identified by hybridizing 
amplified DNA to radiolabeled LSG RNA or alternatively, 
radiolabeled LSG antisense DNA sequences. Perfectly 
5 matched sequences can be distinguished from mismatched 

duplexes by RNase A digestion or by differences in melting 
temperatures . 

Sequence differences between a reference gene and 
genes having mutations also may be revealed by direct DNA 
10 sequencing. In addition, cloned DNA segments may be 

employed as probes to detect specific DNA segments. The 
sensitivity of such methods can be greatly enhanced by 
appropriate use of PCR or another amplification method. 
For example, a sequencing primer is used with double- 
15 stranded PCR product or a single-stranded template molecule 
generated by a modified PCR. The sequence determination is 
performed by conventional procedures with radiolabeled 
nucleotide or by automatic sequencing procedures with 
fluorescent-tags . 
20 Genetic testing based on DNA sequence differences may 

be achieved by detection of alterations in electrophoretic 
mobility of DNA fragments in gels, with or without 
denaturing agents. Small sequence deletions and insertions 
can be visualized by high resolution gel electrophoresis. 
25 DNA fragments of different sequences may be distinguished 
on denaturing formamide gradient gels in which the 
mobilities of different DNA fragments are retarded in the 
gel at different positions according to their specific 
melting or partial melting temperatures (see, e.g., Myers 
30 et al., Science, 230: 1242 (1985)). 

Sequence changes at specific locations also may be 
revealed by nuclease protection assays, such as RNase and 
SI protection or the chemical cleavage method (e.g., Cotton 
et al., Proc. Natl. Acad. Sci . , USA, 85: 4397-4401 (1985)). 
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Thus, the detection of a specific DNA sequence may be 
achieved by methods such as hybridization, RNase 
protection, chemical cleavage, direct DNA sequencing or the 
use of restriction enzymes, (e.g., restriction fragment 
length polymorphisms ( "RFLP" ) and Southern blotting of 
genomic DNA. In addition to more conventional gel- 
electrophoresis and DNA sequencing, mutations also can be 
detected by in situ analysis. 
Chromosome assays 

The LSG sequences of the present invention are also 
valuable for chromosome identification. There is a need 
for identifying particular sites on the chromosome and few 
chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking 
chromosomal location. Each LSG sequence of the present 
invention is specifically targeted to and can hybridize 
with a particular location on an individual human 
chromosome. Thus, the LSGs can be used in the mapping of 
DNAs to chromosomes, an important first step in correlating 
sequences with genes associated with disease. 

In certain preferred embodiments in this regard, the 
cDNA herein disclosed is used to clone genomic DNA of a LSG 
of the present invention. This can be accomplished using a 
variety of well known techniques and libraries, which 
generally are available commercially. The genomic DNA is 
used for in situ chromosome mapping using well known 
techniques for this purpose. 

in some cases, sequences can be mapped to chromosomes 
by preparing PCR primers (preferably 15-25 bp) from the 
cDNA. Computer analysis of the 3' untranslated region of 
the gene is used to rapidly select primers that do not span 
more than one exon in the genomic DNA, thus complicating 
the amplification process. These primers are then used for 
PCR screening of somatic cell hybrids containing individual 
human chromosomes. Only those hybrids containing the human 
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gene corresponding to the primer will yield an amplified 
fragment . 

PCR mapping of somatic cell hybrids is a rapid 
procedure for assigning a particular DNA to a particular 
5 chromosome. Using the present invention with the same 
oligonucleotide primers, sublocalizat ion can be achieved 
with panels of fragments from specific chromosomes or pools 
of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its 
10 chromosome include in situ hybridization, prescreening with 
labeled flow- sorted chromosomes and preselection by 
hybridization to construct chromosome specif ic- cDNA 
libraries . 

Fluorescence in situ hybridization ("FISH") of a cDNA 

15 clone to a metaphase chromosomal spread can be used to 

provide a precise chromosomal location in one step. This 
technique can be used with cDNA as short as 50 or 60 bp. 
This technique is described by Verma et al . (HUMAN 
CHROMOSOMES: A MANUAL OF BASIC TECHNIQUES, Pergamon Press, 

20 New York (1988) ) . 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the sequence 
on the chromosome can be correlated with genetic map data. 
Such data are found, for example, in V. McKusick, MENDELIAN 

25 INHERITANCE IN MAN, available on line through Johns Hopkins 
University, Welch Medical Library. The relationship 
between genes and diseases that have been mapped to the 
same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes) . 

30 Next, it is necessary to determine the differences in 

the cDNA or genomic sequence between affected and 
unaffected individuals. If a mutation is observed in some 
or all of the affected individuals but not in any normal 
individuals, then the mutation is likely to be the 

35 causative agent of the disease. 
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With current resolution of physical mapping and 
genetic mapping techniques, a cDNA precisely localized to a 
chromosomal region associated with the disease could be one 
of between 50 and 500 potential causative genes. (This 
assumes 1 megabase mapping resolution and one gene per 20 
kb) . 

Polypeptide assays 

As described in some detail supra, the present 
invention also relates to diagnostic assays such as 
quantitative and diagnostic assays for detecting levels of 
LSG polypeptide in cells and tissues, and biological fluids 
such as blood and urine, including determination of normal 
and abnormal levels. Thus, for instance, a diagnostic assay 
in accordance with the present invention for detecting 
over-expression or under-expression of a LSG polypeptide 
compared to normal control tissue samples may be used to 
detect the presence of neoplasia. Assay techniques that 
can be used to determine levels of a protein, such as a LSG 
polypeptide of the present invention, in a sample derived 
0 from a host are well-known to those of skill in the art. 
Such assay methods include radioimmunoassays, competitive- 
binding assays, Western Blot analysis and EL ISA assays. 
Among these ELISAs frequently are preferred. 

For example, antibody- sandwich ELISAs are used to 
5 detect polypeptides in a sample, preferably a biological 
sample. Wells of a microtiter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 
/xg/ml. The antibodies are either monoclonal or polyclonal 
and are produced by methods as described herein. The wells 
0 are blocked so that non-specific binding of the polypeptide 
to the well is reduced. The coated wells are then 
incubated for > 2 hours at room temperature with a sample 
containing the LSG polypeptide. Preferably, serial 
dilutions of the sample should be used to validate results. 
5 The plates are then washed three times with deionized or 
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distilled water to remove unbounded polypeptide. Next, 50 
fil of specific antibody-alkaline phosphatase conjugate, at 
a concentration of 25-400 ng, is added and incubated for 2 
hours at room temperature. The plates are again washed 
5 three times with deionized or distilled water to remove 
unbounded conjugate. 4 -methylumbellif eryl phosphate (MUP) 
or p-nitrophenyl phosphate (NPP) substrate solution (75^x1) 
is then added to each well and the plate is incubated 1 
hour at room temperature. The reaction is measured by a 

10 microtiter plate reader. A standard curve is prepared using 
serial dilutions of a control sample, and polypeptide 
concentration is plotted on the X-axis (log scale) while 
fluorescence or absorbance is plotted on the Y-axis (linear 
scale) . The concentration of the LSG polypeptide in the 

15 sample is interpolated using the standard curve. 
Antibodies 

As discussed in some detail supra, LSG polypeptides, 
their fragments or other derivatives, or analogs thereof, 
or cells expressing them can be used as an immunogen to 
20 produce antibodies thereto. These antibodies can be 

polyclonal or monoclonal antibodies. The present invention 
also includes chimeric, single chain, and humanized 
antibodies, as well as Fab fragments, or the product of an 
Fab expression library. Various procedures known in the art 

2 5 may be used for the production of such antibodies and 

fragments . 

A variety of methods for antibody production are set 
forth in Current Protocols, Chapter 2. 

For example, cells expressing a LSG polypeptide of 

3 0 the present invention can be administered to an animal to 

induce the production of sera containing polyclonal 
antibodies. In a preferred method, a preparation of the 
secreted protein is prepared and purified to render it 
substantially free of natural contaminants. This 
3 5 preparation is then introduced into an animal in order to 
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produce polyclonal antisera of greater specific activity. 
The antibody obtained will bind with the LSG polypeptide 
itself. In this manner, even a sequence encoding only a 
fragment of the LSG polypeptide can be used to generate 
antibodies binding the whole native polypeptide. Such 
antibodies can then be used to isolate the LSG polypeptide 
from tissue expressing that LSG polypeptide. 

Alternatively, monoclonal antibodies can be prepared. 
Examples of techniques for production of monoclonal 
antibodies include, but are not limited to, the hybridoma 
technique (Kohler, G. and Milstein, C, Nature 256: 495-497 
(1975) , the trioma technique, the human B-cell hybridoma 
technique (Kozbor et al . , Immunology Today 4: 72 (1983) and 
(Cole et al., pg. 77-96 in MONOCLONAL ANTIBODIES AND CANCER 
1 THERAPY, Alan R. Liss, Inc. (1985). The EBV-hybridoma 
technique is useful in production of human monoclonal 
antibodies . 

Hybridoma technologies have also been described by 
Khler et al . (Eur. J. Immunol. 6: 511 (1976)) Khler et al . 

0 (Eur. J. Immunol. 6: 292 (1976)) and Hammerling et al . (in: 
Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N. 
Y., pp. 563-681 (1981)). In general , such procedures 
involve immunizing an animal (preferably a mouse) with LSG 
polypeptide or, more preferably, with a secreted LSG 

5 polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is 
preferable to culture cells in Earle's modified Eagle's 
medium supplemented with 10% fetal bovine serum 
(inactivated at about 56°C) , and supplemented with about 10 

0 g/1 of nonessential amino acids, about 1,000 U/ml of 
penicillin, and about 100 ng/ml of streptomycin. The 
splenocytes of such mice are extracted and fused with a 
suitable myeloma cell line. Any suitable myeloma cell line 
may be employed in accordance with the present invention; 

5 however, it is preferable to employ the parent myeloma cell 
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line (SP20) , available from the ATCC . After fusion, the 
resulting hybridoma cells are selectively maintained in HAT 
medium, and then cloned by limiting dilution as described 
by Wands et al . (Gastroenterology 80: 225-232 (1981).). 

5 The hybridoma cells obtained through such a selection are 
then assayed to identify clones which secrete antibodies 
capable of binding the polypeptide. 

Alternatively, additional antibodies capable of 
binding to the polypeptide can be produced in a two-step 

10 procedure using anti - idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves 
antigens, and therefore, it is possible to obtain an 
antibody which binds to a second antibody. In accordance 
with this method, protein specific antibodies are used to 

15 immunize an animal, preferably a mouse. The splenocytes of 
such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones 
which produce an antibody whose ability to bind to the 
protein-specific antibody can be blocked by the 

20 polypeptide. Such antibodies comprise anti - idiotypic 

antibodies to the protein specific antibody and can be used 
to immunize an animal to induce formation of further 
protein-specific antibodies . 

Techniques described for the production of single 

25 chain antibodies (U.S. Patent 4,946,778) can also be 

adapted to produce single chain antibodies to immunogenic 
polypeptide products of this invention. Also, transgenic 
mice, as well as other nonhuman transgenic animals, may be 
used to express humanized antibodies to immunogenic 

30 polypeptide products of this invention. 

It will be appreciated that Fab, F(ab')2 and other 
fragments of the antibodies of the present invention may 
also be used according to the methods disclosed herein. 
Such fragments are typically produced by proteolytic 

3 5 cleavage, using enzymes such as papain (to produce Fab 
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fragments) or pepsin (to produce F(ab')2 fragments). 
Alternatively, secreted protein-binding fragments can be 
produced through the application of recombinant DNA 
technology or through synthetic chemistry. 
5 For in vivo use of antibodies in humans, it may be 

preferable to use "humanized" chimeric monoclonal 
antibodies. Such antibodies can be produced using genetic 
constructs derived from hybridoma cells producing the 
monoclonal antibodies described above. Methods for 
10 producing chimeric antibodies are known in the art (See, 
for review, Morrison, Science 229: 1202 (1985); Oi et al . , 
BioTechniques 4: 214 (1986); Cabilly et al . , U. S. Patent 
4,816,567; Taniguchi et al . , EP 171496; Morrison et al . , EP 
173494; Neuberger et al . , WO 8601533; Robinson et al . , WO 
15 8702671; Boulianne et al . , Nature 312: 643 (1984); 
Neuberger et al . , Nature 314: 268 (1985).) 

The above-described antibodies may be employed to 
isolate or to identify clones expressing LSG polypeptides 
or purify LSG polypeptides of the present invention by 
20 attachment of the antibody to a solid support for isolation 
and/or purification by affinity chromatography. As 
discussed in more detail supra, antibodies specific against 
a LSG may also be used to image tumors, particularly cancer 
of the lung, in patients suffering from cancer. Such 
25 antibodies may also be used therapeutically to target 
tumors expressing a LSG. 

Preferred exemplary antigenic epitopes of LSGs of the 
present invention which have been identified are depicted 
below. The antigenicity index (Al avg) used is Jameson - 
30 Wolf. In some embodiment, it may be preferred to raise 
antibodies against these regions of the LSGs. 

LSG of SEQ ID NO: 39 

positions Al avg length 

176-220 1.37 45 

35 399-410 1 . 18 12 

301-317 1 . 13 17 
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370-391 1.13 22 

23-34 1.07 12 

149-174 1.00 26 

51-67 1.00 17 
5 LSG of SEQ ID NO: 42 

positions AI avg length 

453-465 1.25 13 

399-409 1.25 11 

572-584 1.20 13 

10 874-887 1-18 14 

226-235 1.15 10 

30-51 1.09 22 

910-920 1.07 11 

991-1010 1.06 20 

15 655-668 1.06 14 

362-373 1.00 12 
LSG of SEQ ID NO: 44 

positions AI avg length 

134-160 1.23 27 

20 415-436 1.17 22 

485-515 1.16 31 

459-474 1.10 16 

200-210 1.08 11 

535-562 1.04 28 

25 91-115 1.04 25 

523-532 1.02 10 

8-20 1.01 13 
LSG of SEQ ID NO: 45 

positions AI avg length 

30 563-586 1.19 24 

395-408 1.09 14 

130-139 1 . 04 10 

117-127 1 . 02 11 

165-189 1 . 01 25 
35 LSG of SEQ ID NO:46 

positions AI avg length 

122-137 1.10 16 
LSG of SEQ ID NO: 47 

positions AI avg length 

40 1045-1054 1.12 10 

845-880 1.10 36 

919-945 1.10 27 

1376-1418 1 . 10 43 

144-164 1.10 21 

45 814-835 1.09 22 

706-755 1.06 50 

401-416 1.05 16 

445-491 1.04 47 

1061-1085 1 . 03 25 

50 422-442 1.02 21 
LSG of SEQ ID NO: 48 
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positions 

340-362 
155-164 
228-240 
5 3-14 

LSG of SEQ ID NO: 49 
positions 
189-204 
LSG of SEQ ID NO: 50 
10 positions 
134-143 
23-45 
LSG of SEQ ID NO: 51 
positions 
15 53-68 

LSG of SEQ ID NO: 53 
positions 
367-392 
491-504 
20 14-35 

275-284 
208-219 
439-456 
LSG of SEQ ID NO: 54 
25 positions 
1671-1681 
453-465 
1748-1759 
1725-1738 
30 1804-1825 
1644-1655 
1281-1295 
1532-1545 
1351-1369 
35 1040-1062 
1334-1347 
145-155 
1121-1132 
1307-1318 
40 1376-1408 
650-660 
802-823 
714-735 
1885-1898 
45 1967-1976 

LSG of SEQ ID NO: 55 
positions 
297-311 
328-344 
50 16-25 
96-113 



AI avg 

1 . 05 
1 . 01 
1.00 
1. 00 

AI avg 

1 . 08 

AI avg 

1 .21 
1 . 01 

AI avg 

1 . 14 

AI avg 

1 .32 
07 
04 
03 
03 
02 



1 . 

1 , 
1 . 
1 . 
1 



AI avg 

1.35 
26 
23 
, 19 
, 15 
.13 
.12 
. 11 
. 07 
. 06 
. 05 
. 05 
. 05 
. 02 
. 02 
01 
00 
00 
00 
00 



1 
1 
1 
1 

1 . 

1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 , 
1 . 
1 , 
1 
1 



AI avg 

1.31 
1.25 
1.20 
1 . 12 



length 

23 
10 
13 
12 

length 

16 

length 

10 

23 

length 

16 

length 

26 
14 
22 
10 
12 
18 

length 

11 
13 
12 
14 
22 
12 
15 
14 
19 
23 
14 
11 
12 
12 
33 
11 
22 
22 
14 
10 

length 

15 
17 
10 
18 
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381-393 


1 . 12 


J- 3 


236-250 


1 . 1U 




354-364 


1 .09 


11 


441-451 


1 . 07 


11 


274-291 


1 .00 


18 


LSG of SEQ ID NO: 56 






positions 


A I avg 


length 


197-210 


1.03 


14 


318-328 


1 . 02 


11 



10 



LSG binding molecules and assays 

This invention also provides a method for 
identification of molecules, such as receptor molecules, 
that bind LSGs . Genes encoding proteins that bind LSGs, 
p 15 such as receptor proteins, can be identified by numerous 

;| methods known to those of skill in the art. Examples 

S include, but are not limited to, ligand panning and FACS 

'fj sorting. Such methods are described in many laboratory 

OH manuals such as, for instance, Coligan et al . , Current 

y 20 Protocols in Immunology 1(2): Chapter 5 (1991). 

□ Expression cloning may also be employed for this 

In purpose. To this end, polyadenylated RNA is prepared from 

□ a cell responsive to a LSG of the present invention. A cDNA 
library is created from this RNA and the library is divided 

25 into pools. The pools are then transfected individually 
into cells that are not responsive to a LSG of the present 
invention. The transfected cells then are exposed to 
labeled LSG. LSG polypeptides can be labeled by a variety 
of well-known techniques including, but not limited to, 

30 standard methods of radio- iodinat ion or inclusion of a 
recognition site for a site-specific protein kinase. 
Following exposure, the cells are fixed and binding of 
labeled LSG is determined. These procedures conveniently 
are carried out on glass slides. Pools containing labeled 

35 LSG are identified as containing cDNA that produced LSG- 
binding cells. Sub-pools are then prepared from these 
positives, transfected into host cells and screened as 
described above. Using an iterative sub-pooling and re- 
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screening process, one or more single clones that encode 
the putative binding molecule, such as a receptor molecule, 
can be isolated. 

Alternatively a labeled ligand can be photoaf f inity 
5 linked to a cell extract, such as a membrane or a membrane 
extract, prepared from cells that express a molecule that 
it binds, such as a receptor molecule. Cross- linked 
material is resolved by polyacrylamide gel electrophoresis 
("PAGE") and exposed to X-ray film. The labeled complex 

10 containing the ligand-receptor can be excised, resolved 
into peptide fragments, and subjected to protein 
microsequencing. The amino acid sequence obtained from 
microsequencing can be used to design unique or degenerate 
oligonucleotide probes to screen cDNA libraries to identify 

15 genes encoding the putative receptor molecule. 

Polypeptides of the invention also can be used to 
assess LSG binding capacity of LSG binding molecules, such 
as receptor molecules, in cells or in cell-free 
preparations . 

2 0 Agonists and antagonists - assays and molecules 

The invention also provides a method of screening 
compounds to identify those which enhance or block the 
action of a LSG on cells. By "compound", as used herein, 
it is meant to be inclusive of small organic molecules, 
25 peptides, polypeptides and antibodies as well as any other 
candidate molecules which have the potential to enhance or 
agonize or block or antagonize the action of LSG on cells. 
As used herein, an agonist is a compound which increases 
the natural biological functions of a LSG or which 

3 0 functions in a manner similar to a LSG, while an 

antagonist, as used herein, is a compound which decreases 
or eliminates such functions. Various known methods for 
screening for agonists and/or antagonists can be adapted 
for use in identifying LSG agonist or antagonists. 
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For example, a cellular compartment, such as a 
membrane or a preparation thereof, such as a membrane- 
preparation, may be prepared from a cell that expresses a 
molecule that binds a LSG, such as a molecule of a 
5 signaling or regulatory pathway modulated by LSG. The 

preparation is incubated with labeled LSG in the absence or 
the presence of a compound which may be a LSG agonist or 
antagonist. The ability of the compound to bind the 
binding molecule is reflected in decreased binding of the 

10 labeled ligand. Compounds which bind gratuitously, i.e., 
without inducing the effects of a LSG upon binding to the 
LSG binding molecule are most likely to be good 
antagonists. Compounds that bind well and elicit effects 
that are the same as or closely related to LSG are 

15 agonists. LSG-like effects of potential agonists and 

antagonists may by measured, for instance, by determining 
activity of a second messenger system following interaction 
of the candidate molecule with a cell or appropriate cell 
preparation, and comparing the effect with that of LSG or 

20 molecules that elicit the same effects as LSG. Second 
messenger systems that may be useful in this regard 
include, but are not limited to, AMP guanylate cyclase, ion 
channel or phosphoinositide hydrolysis second messenger 
systems . 

25 Another example of an assay for LSG antagonists is a 

competitive assay that combines LSG and a potential 
antagonist with membrane -bound LSG receptor molecules or 
recombinant LSG receptor molecules under appropriate 
conditions for a competitive inhibition assay. LSG can be 

3 0 labeled, such as by radioactivity, such that the number of 
LSG molecules bound to a receptor molecule can be 
determined accurately to assess the effectiveness of the 
potential antagonist. 

Potential antagonists include small organic 

3 5 molecules, peptides, polypeptides and antibodies that bind 
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to a LSG polypeptide of the invention and thereby inhibit 
or extinguish its activity. Potential antagonists also may 
be small organic molecules, a peptide, a polypeptide such 
as a closely related protein or antibody that binds the 
5 same sites on a binding molecule, such as a receptor 

molecule, without inducing LSG- induced activities, thereby 
preventing the action of LSG by excluding LSG from binding. 

Potential antagonists include small molecules which 
bind to and occupy the binding site of the LSG polypeptide 

10 thereby preventing binding to cellular binding molecules, 
such as receptor molecules, such that normal biological 
activity is prevented. Examples of small molecules include 
but are not limited to small organic molecules, peptides or 
peptide-like molecules. 

15 Other potential antagonists include antisense 

molecules. Antisense technology can be used to control gene 
expression through antisense DNA or RNA or through triple- 
helix formation. Antisense techniques are discussed, for 
example, in Okano, J. Neurochem. 56: 560 (1991); 

2 0 OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE 

EXPRESSION, CRC Press, Boca Raton, Fla. (1988) . Triple 
helix formation is discussed in, for instance Lee et al . , 
Nucleic Acids Research 6: 3073 (1979); Cooney et al . , 
Science 241: 456 (1988); and Dervan et al . , Science 251: 
25 1360 (1991) . The methods are based on binding of a 

polynucleotide to a complementary DNA or RNA. For example, 
the 5' coding portion of a polynucleotide that encodes a 
mature LSG polypeptide of the present invention may be used 
to design an antisense RNA oligonucleotide of from about 10 

3 0 to 40 base pairs in length. A DNA oligonucleotide is 

designed to be complementary to a region of the gene 
involved in transcription thereby preventing transcription 
and the production of a LSG polypeptide. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks 
35 translation of the mRNA molecule into a LSG polypeptide. 
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The oligonucleotides described above can also be delivered 
to cells such that the ant i sense RNA or DNA may be 
expressed in vivo to inhibit production of a LSG. 
Composi tions 

The present invention also relates to compositions 
comprising a LSG polynucleotide or a LSG polypeptide or an 
agonist or antagonist thereof. 

For example, a LSG polynucleotide, polypeptide or an 
agonist or antagonist thereof of the present invention may 
be employed in combination with a non- sterile or sterile 
carrier or carriers for use with cells, tissues or 
organisms, such as a pharmaceutical carrier suitable for 
administration to a subject- Such compositions comprise, 
for instance, a media additive or a therapeutically 
effective amount of a polypeptide of the invention and a 
pharmaceutical^ acceptable carrier or excipient . Such 
carriers may include, but are not limited to, saline, 
buffered saline, dextrose, water, glycerol, ethanol and 
combinations thereof. The formulation should suit the mode 
of administration. 

Compositions of the present invention will be 
formulated and dosed in a fashion consistent with good 
medical practice, taking into account the clinical 
condition of the individual patient (especially the side 
effects of treatment with the polypeptide or other compound 
alone) , the site of delivery, the method of administration, 
the scheduling of administration, and other factors known 
to practitioners. The "effective amount" for purposes 
herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ 
effective amount of secreted polypeptide administered 
parenterally per dose will be in the range of about 1, 
/xg/kg/day to 10 mg/kg/day of patient body weight, although, 
as noted above, this will be subject to therapeutic 
discretion. More preferably, this dose is at least 0.01 
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mg/kg/day, and most preferably for humans between about 
0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the polypeptide or other compound is 
typically administered at a dose rate of about 1 ^g/kg/hour 
5 to about 50 mg/kg/hour, either by 1-4 injections per day or 
by continuous subcutaneous infusion, for example, using a 
mini -pump. An intravenous bag solution may also be 
employed. The length of treatment needed to observe changes 
and the interval following treatment for responses to occur 
10 appears to vary depending on the desired effect. 

Pharmaceutical compositions containing the secreted 
protein of the invention are administered orally, rectally, 
parenterally, intracistemally , intravaginally , 
intraperitoneally, topically (as by powders, ointments, 
15 gels, drops or transdermal patch), bucally, or as an oral 
or nasal spray. " Pharmaceut ically acceptable carrier- 
refers to a non-toxic solid, semisolid or liquid filler, 
diluent, encapsulating material or formulation auxiliary of 
any type. The term "parenteral" as used herein refers to 
20 modes of administration which include intravenous, 

intramuscular , intraperitoneal , intrasternal , subcutaneous 
and intraarticular injection and infusion. 

The polypeptide or other compound is also suitably 
administered by sustained-release systems. Suitable 
25 examples of sustained-release compositions include 
semipermeable polymer matrices in the form of shaped 
articles, e. g., films, or microcapsules . Sustained-release 
matrices include polylactides (U.S. Patent 3,773,919 and EP 
58481) , copolymers of L-glutamic acid and gamma- ethyl -L- 
30 glutamate (Sidman, U. et al . , Biopolymers 22: 547-556 

(1983)), poly (2-hydroxyethyl methacrylate) (R. Langer et 
al., J. Biomed. Mater. Res. 15: 167-277 (1981), and R. 
Langer, Chem. Tech. 12: 98-105 (1982)), ethylene vinyl 
acetate (R. Langer et al . ) and poly-D- ( - ) - 3 -hydroxybutyric 
35 acid (EP 133,988). Sustained-release compositions also 
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include liposomally entrapped polypeptides. Liposomes 
containing the polypeptide or other compound are prepared 
by well known methods (Epstein et al . , Proc . Natl. Acad. 
Sci. USA 82: 3688-3692 (1985); Hwang et al . , Proc. Natl. 
5 Acad. Sci. USA 77: 4030-4034 (1980); EP 52322; EP 36676; EP 
88046; EP 143949; EP 142641; Japanese Pat. Appl . 83-118008; 
U.S. Patent 4,485,045 and 4,544,545; and EP 102324). 
Ordinarily, the liposomes are of the small (about 200-800 
Angstroms) unilamellar type in which the lipid content is 
10 greater than about 30 mol . percent cholesterol, the 

selected proportion being adjusted for the optimal therapy. 

For parenteral administration, in one embodiment, the 
polypeptide or other compound is formulated generally by 
mixing it at the desired degree of purity, in a unit dosage 
15 injectable form (solution, suspension, or emulsion), with a 
£j pharmaceutical^ acceptable carrier, i.e., one that is non- 

sj toxic to recipients at the dosages and concentrations 

jU employed and is compatible with other ingredients of the 

iasf 

%J formulation . 

^ 20 For example, the formulation preferably does not 

5 include oxidizing agents and other compounds that are known 

114 to be deleterious to the polypeptide or other compound. 

Generally, the formulations are prepared by 
contacting the polypeptide or other compound uniformly and 
25 intimately with liquid carriers or finely divided solid 

carriers or both. Then, if necessary, the product is shaped 
into the desired formulation. Preferably the carrier is a 
parenteral carrier, more preferably a solution that is 
isotonic with the blood of the recipient. Examples of such 
30 carrier vehicles include water, saline, Ringer's solution, 
and dextrose solution. Non-aqueous vehicles such as fixed 
oils and ethyl oleate are also useful herein, as well as 
liposomes . 

The carrier suitably contains minor amounts of 
35 additives such as substances that enhance isotonicity and 
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chemical stability. Such materials are non-toxic to 
recipients at the dosages and concentrations employed, and 
include buffers such as phosphate, citrate, succinate, 
acetic acid, and other organic acids or their salts; 
5 antioxidants such as ascorbic acid; low molecular weight 
(less than about ten residues) polypeptides, e. g., 
polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers 
such as polyvinylpyrrolidone; amino acids, such as glycine, 
10 glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including cellulose 

4f or its derivatives, glucose, mannose, or dextrins; 

3 chelating agents such as EDTA ,* sugar alcohols such as 

mannitol or sorbitol; counterions such as sodium; and/or 

IH 15 nonionic surfactants such as polysorbates , poloxamers, or 

~j PEG. 

s The polypeptide or other compound is typically 

H formulated in such vehicles at a concentration of about 0.1 

Ri mg/ml to 100 mg/ml , preferably 1-10 mg/ml, at a pH of about 

S 20 3 to 8. It will be understood that the use of certain of 

N= the foregoing excipients, carriers, or stabilizers will 

result in the formation of polypeptide salts or salts of 
the other compounds . 

Any polypeptide to be used for therapeutic 
25 administration should be sterile. Sterility is readily 
accomplished by filtration through sterile filtration 
membranes (e. g., 0.2 micron membranes). Therapeutic 
polypeptide compositions generally are placed into a 
container having a sterile access port, for example, an 
30 intravenous solution bag or vial having a stopper 
pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or 
mult i -dose containers, for example, sealed ampules or 
vials, as an aqueous solution or as a lyophilized 
35 formulation for reconst itut ion . As an example of a 
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lyophilized formulation, 10-ml vials are filled with 5 ml 
of sterile-filtered 1 % (w/v) aqueous polypeptide solution, 
and the resulting mixture is lyophilized. The infusion 
solution is prepared by reconstituting the lyophilized 
5 polypeptide using bacteriostatic Water-f or- Inj ection . 
Kits 

The invention further relates to pharmaceutical packs 
and kits comprising one or more containers filled with one 
or more of the ingredients of the aforementioned 

10 compositions of the invention. Associated with such 

container (s) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, reflecting 
approval by the agency of the manufacture, use or sale of 

15 the product for human administration. 
Administration 

LSG polypeptides or polynucleotides or other 
compounds, preferably agonists or antagonists thereof of 
the present invention may be employed alone or in 

20 conjunction with other compounds, such as therapeutic 
compounds . 

The pharmaceutical compositions may be administered 
in any effective, convenient manner including, for 
instance, administration by topical, oral, anal, vaginal, 

2 5 intravenous, intraperitoneal, intramuscular, subcutaneous, 

intranasal or intradermal routes among others. 

The pharmaceutical compositions generally are 
administered in an amount effective for treatment or 
prophylaxis of a specific indication or indications. In 

3 0 general, the compositions are administered in an amount of 

at least about 10 //g/kg body weight. However, it will be 
appreciated that optimum dosage will be determined by 
standard methods for each treatment modality and 
indication, taking into account the indication, its 
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severity, route of administration, complicating conditions 
and the like. 

It will be appreciated that conditions caused by a . 
decrease in the standard or normal expression level of a 

5 LSG polypeptide in an individual can be treated by 

administering the LSG polypeptide of the present invention, 
preferably in the secreted form, or an agonist thereof. 
Thus, the invention also provides a method of treatment of 
an individual in need of an increased level of a LSG 

10 polypeptide comprising administering to such an individual 
a pharmaceutical composition comprising an amount of the 
LSG polypeptide or an agonist thereof to increase the 
activity level of the LSG polypeptide in such an 
individual. For example, a patient with decreased levels 

15 of a LSG polypeptide may receive a daily dose 0.1-100 M9/ k 9 
of a LSG polypeptide or agonist thereof for six consecutive 
days. Preferably, if a LSG polypeptide is administered it 
is in the secreted form. 

Compositions of the present invention can also be 

20 administered to treating increased levels of a LSG 

polypeptide. For example, antisense technology can be used 
to inhibit production of a LSG polypeptide of the present 
invention. This technology is one example of a method of 
decreasing levels of a polypeptide, preferably a secreted 

25 form, due to a variety of etiologies, such as cancer. A 
patient diagnosed with abnormally increased levels of a 
polypeptide can be administered intravenously antisense 
polynucleotides at 0.5, 1.0, 1.5, 2.0 and 3.0 mg/kg day for 
21 days. This treatment is preferably repeated after a 7- 

30 day rest period if the treatment was well tolerated. 

Compositions comprising an antagonist of a LSG polypeptide 
can also be administered to decrease levels of LSG in a 
patient . 
Gene therapy 
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The LSG polynucleotides, polypeptides, agonists and 
antagonists that are polypeptides may be employed in 
accordance with the present invention by expression of such 
polypeptides in vivo, in treatment modalities often 
5 referred to as "gene therapy." 

Thus, for example, cells from a patient may be 
engineered with a polynucleotide, such as a DNA or RNA, 
encoding a polypeptide ex vivo, and the engineered cells 
then can be provided to a patient to be treated with the 
10 polypeptide. For example, cells may be engineered ex vivo 
by the use of a retroviral plasmid vector containing RNA 
encoding a polypeptide of the present invention. Such 
methods are well-known in the art and their use in the 
present invention will be apparent from the teachings 
15 herein. 

Similarly, cells may be engineered in vivo for 
expression of a polypeptide in vivo by procedures known in 
the art. For example, a polynucleotide of the invention 
may be engineered for expression in a replication defective 

20 retroviral vector, as discussed supra. The retroviral 
expression construct then may be isolated and introduced 
into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present 
invention such that the packaging cell now produces 

25 infectious viral particles containing the gene of interest. 
These producer cells may be administered to a patient for 
engineering cells in vivo and expression of the polypeptide 
in vivo. These and other methods for administering a 
polypeptide of the present invention would be apparent to 

30 those skilled in the art upon reading the instant 
application . 

Retroviruses from which the retroviral plasmid 
vectors herein above mentioned may be derived include, but 
are not limited to, Moloney Murine Leukemia Virus, spleen 

3 5 necrosis virus, retroviruses such as Rous Sarcoma Virus, 
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Harvey Sarcoma Virus, avian leukosis virus, gibbon ape 
leukemia virus, human immunodeficiency virus, adenovirus, 
Myeloproliferative Sarcoma Virus, and mammary tumor virus. 
In one embodiment, the retroviral plasmid vector is derived 
5 from Moloney Murine Leukemia Virus. 

Such vectors will include one or more promoters for 
expressing the polypeptide. The selection of a suitable 
promoter will be apparent to those skilled in the art from 
the teachings contained herein. However, examples of 

10 suitable promoters which may be employed include, but are 
not limited to, the retroviral LTR, the SV40 promoter, the 
human cytomegalovirus (CMV) promoter described in Miller et 
al., Biotechniques 7: 980-990 (1989), and eukaryotic 
cellular promoters such as the histone, RNA polymerase III, 

15 and beta-actin promoters. Other viral promoters which may 
be employed include, but are not limited to, adenovirus 
promoters, thymidine kinase (TK) promoters, and B19 
parvovirus promoters. Additional promoters which may be 
used include respiratory syncytial virus (RSV) promoter, 

2 0 inducible promoters such as the MMT promoter, the 

metallothionein promoter, heat shock promoters, the albumin 
promoter, the ApoAI promoter, human globin promoters, viral 
thymidine kinase promoters such as the Herpes Simplex 
thymidine kinase promoter, retroviral LTRs, the beta-actin 
25 promoter, and human growth hormone promoters. The promoter 
also may be the native promoter which controls the gene 
encoding the polypeptide. 

The nucleic acid sequence encoding the polypeptide of 
the present invention will be placed under the control of a 

3 0 suitable promoter. 

In one embodiment, the retroviral plasmid vector is 
employed to transduce packaging cell lines to form producer 
cell lines. Examples of packaging cells which may be 
transfected include, but are not limited to, the PE501, 
35 PA317, Y-2, Y-AM, PA12 , T19-14X, VT-19-17-H2, YCRE , YCRIP, 
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GP+E-86, GP+envAml2, and DAN cell lines as described in 
Miller, A., Human Gene Therapy 1: 5-14 (1990). The vector 
may be transduced into the packaging cells through any 
means known in the art. Such means include, but are not 
5 limited to, electroporation, the use of liposomes, and CaP0 4 
precipitation. Alternatively, the retroviral plasmid 
vector may be encapsulated into a liposome, or coupled to a 
lipid, and then administered to a host. The producer cell 
line will generate infectious retroviral vector particles 

10 which are inclusive of the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector 
particles then may be employed to transduce eukaryotic 
cells, either in vitro or in vivo. The transduced 
eukaryotic cells will express the nucleic acid sequence (s) 

15 encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem 
cells, embryonic carcinoma cells, as well as hematopoietic 
stem cells, hepatocytes, fibroblasts, myoblasts, 
keratinocytes, endothelial cells, and bronchial epithelial 

20 cells. 

An exemplary method of gene therapy involves 
transplantation of fibroblasts which are capable of 
expressing a LSG polypeptide or an agonist or antagonist 
thereof onto a patient. Generally fibroblasts are obtained 

2 5 from a subject by skin biopsy. The resulting tissue is 

placed in tissue-culture medium and separated into small 
pieces. Small chunks of the tissue are placed on a wet 
surface of a tissue culture flask, approximately ten pieces 
are placed in each flask. The flask is turned upside down, 
30 closed tight and left at room temperature over night. 

After 24 hours at room temperature, the flask is inverted 
and the chunks of tissue remain fixed to the bottom of the 
flask and fresh media (e. g., Ham's F12 media, with 10% 
FBS, penicillin and streptomycin) is added. The flasks are 

3 5 then incubated at 3 7°C for approximately one week. At this 
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time, fresh media is added and subsequently changed every 
several days. After an additional two weeks in culture, a 
monolayer of fibroblasts emerge. The monolayer is 
trypsinized and scaled into larger flasks. pMV-7 
5 (Kirschmeier, P. T. et al . , DNA, 7: 219-25 (1988)), flanked 
by the long terminal repeats of the Moloney murine sarcoma 
virus, is digested with EcoRl and Hindlll and subsequently 
treated with calf intestinal phosphatase. The linear 
vector is fractionated on agarose gel and purified, using 
10 glass beads. The cDNA encoding a LSG polypeptide of the 
present invention or an agonist or antagonist thereof can 
be amplified using PCR primers which correspond to their 5' 
and 3' end sequences respectively. Preferably, the 5' 
primer contains an EcoRI site and the 3' primer includes a 
15 Hindlll site. Equal quantities of the Moloney murine 

sarcoma virus linear backbone and the amplified EcoRI and 
Hindlll fragment are added together in the presence of T4 
DNA ligase. The resulting mixture is maintained under 
conditions appropriate for ligation of the two fragments. 
2 0 The ligation mixture is then used to transform bacteria HB 
101, which are then plated onto agar containing kanamycin 
for the purpose of confirming that the vector has the gene 
of interest properly inserted. Amphotropic pA317 or 
GP+aml2 packaging cells are grown in tissue culture to 
25 confluent density in Dulbecco ' s Modified Eagles Medium 
(DMEM) with 10% calf serum (CS) , penicillin and 
streptomycin. The MSV vector containing the gene is then 
added to the media and the packaging cells transduced with 
the vector. The packaging cells now produce infectious 
30 viral particles containing the gene (the packaging cells 
are now referred to as producer cells) . Fresh media is 
added to the transduced producer cells, and subsequently, 
the media is harvested from a 10 cm plate of confluent 
producer cells. The spent media, containing the infectious 
35 viral particles, is filtered through a millipore filter to 
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remove detached producer cells and this media is then used 
to infect fibroblast cells. Media is removed from a sub- 
confluent plate of fibroblasts and quickly replaced with 
the media from the producer cells. This media is removed 
5 and replaced with fresh media. If the titer of virus is 
high, then virtually all fibroblasts will be infected and 
no selection is required. If the titer is very low, then 
it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the 
10 fibroblasts have been efficiently infected, the fibroblasts 
are analyzed to determine whether protein is produced. The 
engineered fibroblasts are then transplanted onto the host, 
either alone or after having been grown to confluence on 
cytodex 3 microcarrier beads. 
15 Alternatively, in vivo gene therapy methods can be 

used to treat LSG related disorders, diseases and 
conditions. Gene therapy methods relate to the 
introduction of naked nucleic acid (DNA, RNA, and antisense 
DNA or RNA) sequences into an animal to increase or 
20 decrease the expression of the polypeptide. 

For example, a LSG polynucleotide of the present 
invention or a nucleic acid sequence encoding an agonist or 
antagonist thereto may be operatively linked to a promoter 
or any other genetic elements necessary for the expression 
25 of the polypeptide by the target tissue. Such gene therapy 
and delivery techniques and methods are known in the art, 
see, for example, WO 90/11092, WO 98/11779; U.S. Patents 
5,693,622, 5,705,151, and 5,580,859; Tabata H. et al . 
(1997) Cardiovasc. Res. 35 (3): 470-479, Chao J et al . 
30 (1997) Pharmacol. Res. 35 (6): 517-522, Wolff J. A. (1997) 
Neuromuscul. Disord. 7 (5): 314-318, Schwartz B. et al . 
(1996) Gene Ther. 3 (5): 405-411, Tsurumi Y. et al . (1996) 
Circulation 94 (12) : 3281-3290 (incorporated herein by 
reference) . The polynucleotide constructs may be delivered 
35 by any method that delivers injectable materials to the 
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cells of an animal, such as, injection into the 
interstitial space of tissues (heart, muscle, skin, lung, 
liver, intestine and the like) . The polynucleotide 
constructs can be delivered in a pharmaceutically 
5 acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers 
to sequences that are free from any delivery vehicle that 
acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome 
10 formulations, lipofectin or precipitating agents and the 
like. However, polynucleotides may also be delivered in 
liposome formulations (such as those taught in Feigner P. 
0 L. et al. (1995) Ann. NY Acad. Sci . 772: 126-139 and 

| Abdallah B. et al . (1995) Biol. Cell 85 (1): 1-7) which can 

3 15 be prepared by methods well known to those skilled in the 

art . 

The polynucleotide vector constructs used in the gene 
therapy method are preferably constructs that will not 
integrate into the host genome nor will they contain 
20 sequences that allow for replication. Any strong promoter 
known to those skilled in the art can be used for driving 
the expression of DNA . Unlike other gene therapies 
techniques, one major advantage of introducing naked 
nucleic acid sequences into target cells is the transitory 
25 nature of the polynucleotide synthesis in the cells. 

Studies have shown that non-replicating DNA sequences can 
be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the 
30 interstitial space of tissues within the an animal, 

including of muscle, skin, brain, lung, liver, spleen, bone 
marrow, thymus, heart, lymph, blood, bone, cartilage, 
pancreas, kidney, gall bladder, stomach, intestine, testis, 
ovary, uterus, rectum, nervous system, eye, gland, and 
35 connective tissue. Interstitial space of the tissues 
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comprises the intercellular fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic 
fibers in the walls of vessels or chambers, collagen fibers 
of fibrous tissues, or that same matrix within connective 
5 tissue ensheathing muscle cells or in the lacunae of bone. 
It is similarly the space occupied by the plasma of the 
circulation and the lymph fluid of the lymphatic channels. 
Delivery to the interstitial space of muscle tissue is 
preferred. The polynucleotide construct may be 
10 conveniently delivered by injection into the tissues 

comprising these cells. They are preferably delivered to 
and expressed in persistent, non-dividing cells which are 
differentiated, although delivery and expression may be 
achieved in non-differentiated or less completely 
15 differentiated cells, such as, for example, stem cells of 
blood or skin fibroblasts. In vivo muscle cells are 
particularly competent in their ability to take up and 
express polynucleotides . 

For the naked polynucleotide injection, an effective 
20 dosage amount of DNA or RNA will be in the range of from 

about 0.05 M9/kg body weight to about 50 mg/kg body weight. 
Preferably the dosage will be from about 0.005 mg/kg to 
about 20 mg/kg and more preferably from about 0.05 mg/kg to 
about 5 mg/kg. Of course, as the artisan of ordinary skill 
25 will appreciate, this dosage will vary according to the 
tissue site of injection. The appropriate and effective 
dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the 
condition being treated and the route of administration. 
3 0 The preferred route of administration is by the parenteral 
route of injection into the interstitial space of tissues. 
However, other parenteral routes may also be used, such as, 
inhalation of an aerosol formulation particularly for 
delivery to lungs or bronchial tissues, throat or mucous 
35 membranes of the nose. In addition, naked polynucleotide 
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constructs can be delivered to arteries during angioplasty 
by the catheter used in the procedure. 

The dose response effects of injected polynucleotide 
in muscle in vivo is determined as follows. Suitable 
5 template DNA for production of mRNA coding for polypeptide 
of the present invention is prepared in accordance with a 
standard recombinant DNA methodology. The template DNA, 
which may be either circular or linear, is either used as 
naked DNA or complexed with liposomes. The quadriceps 
10 muscles of mice are then injected with various amounts of 
the template DNA. 

Five to six week old female and male Balb/C mice are 
anesthetized by intraperitoneal injection with 0.3 ml of 
2.5% Avertin. A 1.5 cm incision is made on the anterior 
15 thigh, and the quadriceps muscle is directly visualized. 

The template DNA is injected in 0.1 ml of carrier in a 1 cc 
syringe through a 27 gauge needle over one minute, 
approximately 0 . 5 cm from the distal insertion site of the 
muscle into the knee and about 0 . 2 cm deep. A suture is 
20 placed over the injection site for future localization, and 
the skin is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) 
muscle extracts are prepared by excising the entire 
quadriceps. Every fifth 15 fim cross-section of the 
25 individual quadriceps muscles is histochemically stained 
for protein expression. A time course for protein 
expression may be done in a similar fashion except that 
quadriceps from different mice are harvested at different 
times. Persistence of DNA in muscle following injection 
3 0 may be determined by Southern blot analysis after preparing 
total cellular DNA and HIRT supernatants from injected and 
control mice. 

The results of the above experimentation in mice can 
be use to extrapolate proper dosages and other treatment 
3 5 parameters in humans and other animals using naked DNA. 
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Nonhuman Transgenic Animals 

The LSG polypeptides of the invention can also be 
expressed in nonhuman transgenic animals. Nonhuman animals 
of any species, including, but not limited to, mice, rats, 
5 rabbits, hamsters, guinea pigs, pigs, micro-pigs, goats, 
sheep, cows and non-human primates, e. g., baboons, 
monkeys, and chimpanzees, may be used to generate 
transgenic animals. Any technique known in the art may be 
used to introduce the transgene (I. e., polynucleotides of 
10 the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not 
limited to, pronuclear microinjection (Paterson et al . , 
Appl. Microbiol. Biotechnol. 40: 691-698 (1994); Carver et 
al., Biotechnology (NY) 11: 1263-1270 (1993); Wright et 
15 al., Biotechnology (NY) 9: 830-834 (1991); andHoppe et 
al., U.S. Patent 4,873,191); retrovirus mediated gene 
transfer into germ lines (Van der Putten et al . , Proc . 
Natl. Acad. Sci., USA 82: 6148-6152 (1985)), blastocysts or 
embryos; gene targeting in embryonic stem cells (Thompson 
20 et al., Cell 56: 313-321 (1989)); electroporation of cells 
or embryos (Lo, 1983, Mol . Cell. Biol. 3: 1803-1814 
(1983)); introduction of the polynucleotides of the 
invention using a gene gun (see, e.g., Ulmer et al . , 
Science 259: 1745 (1993); introducing nucleic acid 
25 constructs into embryonic pluripotent stem cells and 

transferring the stem cells back into the blastocyst; and 
sperm mediated gene transfer (Lavitrano et al . , Cell 57: 
717-723 (1989) ) . For a review of such techniques, see 
Gordon, "Transgenic Animals, " Intl. Rev. Cytol . 115: 171-229 
30 (1989) , which is incorporated by reference herein in its 
entirety . 

Any technique known in the art may be used to produce 
transgenic clones containing polynucleotides of the 
invention, for example, nuclear transfer into enucleated 
3 5 oocytes of nuclei from cultured embryonic, fetal, or adult 
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cells induced to quiescence (Campell et al . , Nature 380: 
64-66 (1996); Wilmut et al . , Nature 385: 810813 (1997)). 

The present invention provides for transgenic animals 
that carry the transgene in all their cells, as well as 
5 animals which carry the transgene in some, but not all 
their cells, i.e., mosaic or chimeric animals. The 
transgene may be integrated as a single transgene or as 
multiple copies such as in concatamers, e. g., head-to-head 
tandems or head-to-tail tandems. The transgene may also be 
10 selectively introduced into and activated in a particular 
cell type by following, for example, the teaching of Lasko 
et al. (Lasko et al . , Proc . Natl. Acad. Sci . USA 89: 6232- 
6236 (1992)). The regulatory sequences required for such a 
cell-type specific activation will depend upon the 
15 particular cell type of interest, and will be apparent to 
those of skill in the art. When it is desired that the 
polynucleotide transgene be integrated into the chromosomal 
site of the endogenous gene, gene targeting is preferred. 
Briefly, when such a technique is to be utilized, vectors 
20 containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal 
sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene 
25 may also be selectively introduced into a particular cell 
type, thus inactivating the endogenous gene in only that 
cell type, by following, for example, the teaching of Gu et 
al. (Science 265: 103-106 (1994)). The regulatory 
sequences required for such a cell-type specific 
30 inactivation will depend upon the particular cell type of 
interest, and will be apparent to those of skill in the 
art . 

Once transgenic animals have been generated, the 
expression of the recombinant gene may be assayed utilizing 
35 standard techniques. Initial screening may be accomplished 
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by Southern blot analysis or PCR techniques to analyze 
animal tissues to verify that integration of the transgene 
has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also 
5 be assessed using techniques which include, but are not 
limited to, Northern blot analysis of tissue samples 
obtained from the animal, in situ hybridization analysis, 
and reverse transcriptase-PCR (rt-PCR) . Samples of 
transgenic gene -expressing tissue may also be evaluated 
10 immunocytochemically or immunohistochemically using 
antibodies specific for the transgene product. 

Once the founder animals are produced, they may be 
bred, inbred, outbred, or crossbred to produce colonies of 
the particular animal. Examples of such breeding 
15 strategies include, but are not limited to: outbreeding of 
founder animals with more than one integration site in 
order to establish separate lines; inbreeding of separate 
lines in order to produce compound transgenics that express 
the transgene at higher levels because of the effects of 
2 0 additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals 
homozygous for a given integration site in order to both 
augment expression and eliminate the need for screening of 
animals by DNA analysis; crossing of separate homozygous 
25 lines to produce compound heterozygous or homozygous lines; 
and breeding to place the transgene on a distinct 
background that is appropriate for an experimental model of 
interest . 

Transgenic animals of the invention have uses which 
30 include, but are not limited to, animal model systems 
useful in elaborating the biological function of LSG 
polypeptides of the present invention, studying conditions 
and/or disorders associated with aberrant expression of 
LSGs, and in screening for compounds effective in 
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ameliorating such LSG associated conditions and/or 
disorders . 
Knock-Out Animals 

Endogenous gene expression can also be reduced by 
5 inactivating or "knocking out" the gene and/or its promoter 
using targeted homologous recombination (e. g., see 
Smithies et al . , Nature 317: 230-234 (1985); Thomas & 
Capecchi, Cell 51: 503512 (1987); Thompson et al . , Cell 5: 
313-321 (1989) ; each of which is incorporated by reference 
10 herein in its entirety) . For example, a mutant, non- 
functional LSG polynucleotide of the invention (or a 
n completely unrelated DNA sequence) flanked by DNA 

5 homologous to the endogenous LSG polynucleotide sequence 

g (either the coding regions or regulatory regions of the 

H? 15 gene) can be used, with or without a selectable marker 

m and/or a negative selectable marker, to transfect cells 

^ that express polypeptides of the invention in vivo. In 

O another embodiment, techniques known in the art are used to 

y generate knockouts in cells that contain, but do not 

3 20 express the gene of interest. Insertion of the DNA 

O construct, via targeted homologous recombination, results 

in inactivation of the targeted gene. Such approaches are 
particularly suited in research and agricultural fields 
where modifications to embryonic stem cells can be used to 
25 generate animal offspring with an inactive targeted gene 
(e. g., see Thomas & Capecchi 1987 and Thompson 1989, 
supra) . This approach can also be routinely adapted for 
use in humans provided the recombinant DNA constructs are 
directly administered or targeted to the required site in 
3 0 vivo using appropriate viral vectors that will be apparent 
to those of skill in the art. 

In further embodiments of the invention, cells that 
are genetically engineered to express the LSG polypeptides 
of the invention, or alternatively, that are genetically 
3 5 engineered not to express the LSG polypeptides of the 
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invention (e. g. , knockouts) are administered to a patient 
in vivo. Such cells may be obtained from the patient or a 
MHC compatible donor and can include, but are not limited 
to, fibroblasts, bone marrow cells, blood cells (e. g., 
lymphocytes) , adipocytes, muscle cells, and endothelial 
cells. The cells are genetically engineered in vitro using 
recombinant DNA techniques to introduce the coding sequence 
of polypeptides of the invention into the cells, or 
alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the 
polypeptides of the invention, e. g., by transduction 
(using viral vectors, and preferably vectors that integrate 
the transgene into the cell genome) or transfection 
procedures, including, but not limited to, the use of 
plasmids, cosmids, YACs, naked DNA, electroporat ion, 

liposomes, etc. 

The coding sequence of the LSG polypeptides of the 
invention can be placed under the control of a strong 
constitutive or inducible promoter or promoter/enhancer to 
achieve expression, and preferably secretion, of the LSG 
polypeptides of the invention. The engineered cells which 
express and preferably secrete the LSG polypeptides of the 
invention can be introduced into the patient systemically , 
e.g., in the circulation, or intraperitoneally . 

Alternatively, the cells can be incorporated into a 
matrix and implanted in the body, e.g., genetically 
engineered fibroblasts can be implanted as part of a skin 
graft or genetically engineered endothelial cells can be 
implanted as part of a lymphatic or vascular graft (see, 
for example, U.S. Patent 5,3 99,34 9 and U.S. Patent 
5,460,959 each of which is incorporated by reference herein 

in its entirety) . 

When the cells to be administered are non- autologous 
or non-MHC compatible cells, they can be administered using 
well known techniques which prevent the development of a 
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host immune response against the introduced cells. For 
example, the cells may be introduced in an encapsulated 
form which, while allowing for an exchange of components 
with the immediate extracellular environment, does not 
5 allow the introduced cells to be recognized by the host 
immune system. 

Transgenic and "knock-out" animals of the invention 
have uses which include, but are not limited to, animal 
model systems useful in elaborating the biological function 
10 of LSG polypeptides of the present invention, studying 
conditions and/or disorders associated with aberrant LSG 
expression, and in screening for compounds effective in 
ameliorating such LSG associated conditions and/or 
disorders . 

15 The following nonlimiting example is provided to 

further illustrate the present invention. 



EXAMPLE 

The following Example is carried out using standard 
techniques, which are well known and routine to those of 
2 0 skill in the art, except where otherwise described in 
detail. Routine molecular biology techniques of the 
following example can be carried out as described in 
standard laboratory manuals, such as Sambrook et al . , 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed.; Cold 

2 5 Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 

(1989) . 

Introduction and background for Microarray analysis 

cDNA microarrays are prepared by high-speed robotic 
printing of thousands of distinct cDNAs in an ordered array 

3 0 on glass microscope slides. They are used to measure the 

relative abundance of specific sequences in two complex 
samples (Schena et al, 1995; Shalon et al, 1996). 
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In the microarray procedure, mRNA is isolated from 
tissues of interest, either from a tumor or control (normal 
or normal adjacent tissue) . mRNA (200-600 ng) from cancer 
tissue or control is reverse transcribed to incorporate the 
5 fluorescent nucleotides Cy5 (red) or Cy3 (green) , 

respectively. The two populations of f luorescently labeled 
cDNA are mixed together and hybridized simultaneously to a 
microarray bearing approximately 10,000 cDNA elements in a 
2cm x 2cm area on a glass slide (Microarrays hybridization 
10 service: Incyte Genomics, Fremont, CA, USA). After 

hybridization, the slides are scanned with a scanning laser 
confocal microscope. 

The scanned image is used to generate the intensity 
and local background measurements for each spot on the 
15 array (GEMtools software, Incyte Genomics) . For each spot, 
representing one EST, the ratio of the normalized Cy5/Cy3 
intensities generates a quantitation of the gene's 
expression in one tissue relative to the control, in this 
case, the expression in cancer tissue versus either normal 
20 or normal adjacent tissue. For example, a gene that shows 
a Cancer-Cy5 intensity of 3000 and a Normal-Cy3 intensity 
of 1000 is expressed 3 -fold more in cancer tissue. 
Advanced analysis software is used to sort and decipher 
patterns of gene expression from the data (Cluster and 
25 Treeview programs, Stanford University; Eisen et al , 1998; 
Alizadeh et al, 2000) . However, the reproducibility study 
from Incyte shows that the level of detectable differential 
expression is calculated to be approximately plus or minus 
1.74. Consequently, any elements with observed ratios 
3 0 greater than or equal to 1 . 8 between cancer and normal are 
deemed differentially expressed. 
References : 

1. Schena, M . , D. Shalon, R.W. Davis, and P.O. Brown. 
1995. Quantitative monitoring of gene expression patterns 
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with a complementary cDNA microarray. Science 270: 
467-470 . 

2. Shalon, D., S.J. Smith, and P.O. Brown. 1996. A DNA 
Microarray System for Analyzing Complex DNA samples Using 

5 Two-color Fluorescent Probe Hybridization. Genome Research 
6: 639-645. 

3. Eisen, M.B., P.T. Spellman, P.O. Brown, and D . Botstein. 
1998. "Cluster analysis and display of genome-wide 
expression patterns" . PNAS 95: 14863-14868. 

10 4. Alizadeh, A. A. , et al, 2000. "Distinct types of diffuse 
large B-cell lymphoma identified by gene expression 
profiling." Nature, 403: 503-511. 

5. GEM Microarray Reproducibility Study. Technical 
specifications from Incyte Genomics. 

15 Lung diaDexus microarray candidates 

Following is a list of "diaDexus microarray 
candidates" sequences for lung cancer, also referred to 
herein as lung specific genes or LSGs : 

Sequences Gene ID/ Clone ID/ ddxid 

1 1040286/ 2746236/ 18867 

2 198406/ 2639142/ 12801 

3 441298/ 1877647/ 8255 

4 244318/ 3032060/ 7048 

5 429368/ 2890670/ 4002 

6 975386/ 289582/ 5018 

7 480710/ 1911471/ 12153 

8 1040699/ 1899557/ 13678 

9 1040383/ 1556335/ 3273 

10 108494/ 3130429/ 3126 

11 331878/ 2445607/ 3070 

12 233442/ 1959959/ 18837 

13 255993/ 1670828/ 7873 

14 897843 / 1823610/ 16315 

15 414885/ 2655867/ 21009 
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16 


1100375/ 690306/ x 


17 


6133/ 3993331/ x 




18 


257782/ 3032060A/ 


7048A 


19 


347005/ 1911471A/ 


12153A 


20 


332710/ 3130429A/ 


3126A 


21 


255828/ 2445607A/ 


3070A 


22 


328565/ 3993331A/ 


X 



Table 1 depicts numbers which are ratios indicating the 
levels of expression of the Clone IDs in the cancer tissue 
10 sample (labeled with Cy5) relative to the normal tissue, or 
the normal adjacent tissue control (labeled with Cy3 ) used 
% in that experiment. The Cy5/Cy3 ratio of the normalized 

^0 fluorescent intensities in each channel is used as a 

J3 measure of relative gene expression. A positive number 

m 15 represents overexpression in cancer relative to the normal 

control. A negative number represents higher expression in 
the normal adjacent sample compared to the cancer tissue 
sample used in that experiment. X means no experiment was 
performed for the particular tissue sample. 
2 0 Table 1: 

ClonelD LN.A143 LN.A160 LN.A182 

Vs. Apool Vs. Apool Vs. Apool 

2746236 5.3 2.6 1-5 

2639142 1.9 1.7 x 

25 1877647 1.6 1.5 x 

3032060 2.1 1-2 x 

2890670 2.6 1.0 2.7 

289582 1.5 1.1 x 



30 



ClonelD LN . A2 13 LN . A2 8 8 LN . A3 2 3 

Vs. Apool Vs. Apool Vs. Apool 

2746236 4.6 8.9 3.7 



2639142 2.9 x 4.5 

1877647 2 x 

3032060 2.3 x 



35 2890670 x 

289582 2.4 x 

ClonelD LN . A3 3 9 LN . A3 4 5 



5 . 0 
4 . 8 

4.8 4.6 

1 . 8 
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Vs. Apool Vs. Apool 

2746236 2.9 2.9 

2639142 1.7 1.2 

1877647 1.9 1-2 

5 3032060 1.2 1.2 

2890670 2.4 1.9 

289582 2.4 1-9 

Absolute values greater than or equal to 1 . 8 are considered 
to be above background levels, and are, therefore 

10 significant (Source: Incyte Genomics: GEM microarray 
technical specifications) . The relative levels of 
expression in Table 1 show that Clone ID 2746236 mRNA 
expression is higher than background in 7 of the cancer 
tissue samples out of a total of 8 experiments. Clone ID 

15 2639142 mRNA expression is higher than background in 3 of 
the cancer tissue samples out of a total of 6 experiments. 
Clone ID 1877647 mRNA expression is higher than background 
in 3 of the cancer tissue samples out of a total of 6 
experiments. Clone ID 3032060 mRNA expression is higher 

2 0 than background in 3 of the cancer tissue samples out of a 

total of 6 experiments. Clone ID 2890670 mRNA expression 
is higher than background in 5 of the cancer tissue samples 
out of a total of 6 experiments. Clone ID 289582 mRNA 
expression is higher than background in 4 of the cancer 
25 tissue samples out of a total of 6 experiments. 

An additional 16 clones have also been identified by 
the same type of experiments. These additional clones all 
show from 30% to 80 % overexpression in cancer tissue 
samples. The sequences of these LSGs are also disclosed 

3 0 herein. 

Semi-quantitative Polymerase Chain Reaction 

Semi-quantitative Polymerase Chain Reaction (SQ-PCR) 
is a method that utilizes end point PCR on serial dilutions 
of cDNA samples in order to determine relative expression 
35 patterns of genes of interest in multiple samples. Using 
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random hexamer primed Reverse Transcription (RT) cDNA 
panels are created from total RNA samples. Gene specific 
primers are then used to amplify fragments using Polymerase 
Chain Reaction (PCR) technology from four lOx serial cDNA 
5 dilutions in duplicate. Relative expression levels of 0, 
1, 10, 100 and 1000 are used to evaluate gene expression. 
A positive reaction in the most dilute sample indicates the 
highest relative expression value. This is determined by 
analysis of the sample reactions on a 2-4% agarose gel. 

10 The tissue samples used include 12 normal, 12 cancer and 6 
pairs tissue specific cancer and matching samples. 

Of the list of "diaDexus microarray candidates" 
sequences for lung cancer, the following sequences were 
analyzed by semi -quantitative PCR and found to be 

15 upregulated in lung adenocarc inoma/ carcinoma . 

Example# SEQ ID NO: Gene ID Clone ID ddxid SQlng 

code 

1 i 1040286 2746236 18867 Sqlng042 

2 3 441298 1877647 8255 Sqlng040 
20 3 ii 331878 2445607 3070 Sqlng046 

4 22 328565 3993331A x Sqlng050 



Example 1 - SEQ ID NO : 1 

Semi quantitative PCR was done using the following 
primers : 
25 Sqlng042 forward: 

5' CCAGAGCCCAAATCTTGTGAC 3' (SEQ ID NO: 23) 
Sqclng042 reverse: 

5- GCGGCTTTGTCTTGGCATTA 3' (SEQ ID NO: 24) 

Table 2 shows absolute numbers which are relative levels of 
3 0 expression of Sqlng042 in 12 normal samples from 12 
different tissues. These RNA samples are individual 
samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
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individuals. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
5 expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 2: 





Tissue 


Normal 




Breast 


1000 


10 


Colon 


1000 




Endometruim 


1000 




Kidney 


1000 




Liver 


10 




Lung 


1000 


15 


Ovary 


1000 




Prostate 


100 




Small Intestine 


1000 




Stomach 


1000 




Testis 


1000 


20 


Uterus 


100 



Relative levels of expression in Table 2 show that normal 
breast, colon, endometrium, kidney, lung, ovary, small 
intestine, stomach and testis show high expression of 
Sqlng042. Moderate levels of expression are apparent in 

25 prostate and uterus. Low levels of expression are apparent 
in normal liver. 

Table 3 shows absolute numbers which are relative 
levels of expression of Sqlng042 in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 

30 (PCR) technology expression levels were analyzed from four 
lOx serial cDNA dilutions in duplicate. Relative 
expression levels of 0, 1, 10, 100 and 1000 are used to 
evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 

3 5 value. 
Table 3 : 

Tissue Cancer 

bladder 1000 
breast 1000 
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colon 


1000 


kidney 


1 


liver 


100 


lung 


1000 


ovary- 


1 


pancreas 


1000 


prostate 


10 


stomach 


1000 


testes 


1 


uterus 


1000 



10 

Relative levels of expression in Table 3 show that Sqlng042 
is expressed in low levels in kidney, ovary, and testis 
carcinomas. Sqlng042 is expressed in high levels in other 
tissue carcinomas. 
15 Table 4 shows absolute numbers which are relative 

levels of expression of Sqlng042 in 6 lung cancer matching 
samples. A matching pair is formed by mRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 

2 0 individual. 

Using Polymerase Chain Reaction (PCR) technology 
expression levels were analyzed from four lOx serial cDNA 
dilutions in duplicate. Relative expression levels of 0, 
1, 10, 100 and 1000 are used to evaluate gene expression. 

25 A positive reaction in the most dilute sample indicates the 
highest relative expression value 
Table 4 : 

Sample ID Tissue Cancer NAT 

9702C115RB lung 1 1 

30 9502C032 lung 1000 1000 

8894A lung 1 10°° 

9704C060RA lung 1 1 

11145B lung 1 1000 

9502C109R lung 1000 1000 

35 Relative levels of expression in Table 4 show that Sqlng042 
is expressed in high levels in two of the six lung cancer 
samples. However, high levels of expression was observed 
in the matching normal adjacent tissue (NAT) . 
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Example 2 - SEQ ID NO : 3 

Semi quantitative PCR was done using the following 

primers : 

Sqlng04 0 forward : 
5 5' ATTGCCATCCCAGTGACAGTG 3' (SEQ ID NO: 25) 
Sqclng040 reverse : 

5' TTGGGAGATGTGGGTGATGAG 3' (SEQ ID NO: 26) 

Table 5 shows absolute numbers which are relative levels of 
expression of Sqlng040 in 12 normal samples from 12 

10 different tissues. These RNA samples are individual 

samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 

15 serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 5: 

20 



25 



30 



Tissue 


Normal 


Breast 


0 


Colon 


0 


Endometrium 


1 


Kidney 


0 


Liver 


0 


Lung 


10 


Ovary 


1 


Prostate 


10 


Small Intestine 


1 


Stomach 


1 


Testis 


100 


Uterus 


1 



Relative levels of expression in Table 5 show that normal 
lung and prostate show moderate expression of Sqlng040. 
35 High level expression is only apparent in testis. Low 
levels of expression are apparent in endometrium, ovary, 
small intestine and uterus. 
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Table 6 shows absolute numbers which are relative 
levels of expression of Sqlng040 in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 
(PCR) technology expression levels were analyzed from four 
5 lOx serial cDNA dilutions in duplicate. Relative 

expression levels of 0, 1, 10, 100 and 1000 are used to 
evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 
value . 
10 Table 6: 







Tissue 


Cancer 






bladder 


0 






breast 


10 






colon 


0 




15 


kidney 


10 






liver 
lung 
ovary 
pancreas 


0 
100 
100 
100 




20 


prostate 


10 






stomach 


10 






testes 


10 






uterus 


10 



Relative levels of expression in Table 6 show that Sqlng04 0 
25 is expressed in moderate to high levels in breast, kidney, 
lung, ovary, pancreas, prostate, stomach, testis and uterus 
carcinomas . 

Table 7 shows absolute numbers which are relative 
levels of expression of Sqlng040 in 6 lung cancer matching 

30 samples. A matching pair is formed by mRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 
individual. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 

35 serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 




DEX-0214 - 102 - PATENT 

expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 7 : 

Sample ID Tissue Cancer NAT 

5 9702C115RB lung 100 10 

95 02C03 2 lung 10 0 1 

8894A lung 10 0 

97 04C0 6 0RA lung 10 10 

11145B lung 10 100 

10 95 02C10 9R lung 100 10 

Relative levels of expression in Table 7 show that Sqlng040 
is expressed in moderate levels in four of the six lung 
cancer samples compared with the expression in the matching 
normal adjacent tissue (NAT) . 

15 Example 3 - SEQ ID NO: 11 

Semi quantitative PCR was done using the following 

primers : 

Sqlng04 6 forward: 

5' CCTGCCCTGGTATGTTTTTCTT 3' (SEQ ID NO: 27) 

20 Sqlng046 reverse: 

5' CAGCCCACAAATGCCTTCTAC 3' (SEQ ID NO: 28) 

Table 8 shows absolute numbers which are relative 
levels of expression of Sqlng046 in 12 normal samples from 
12 different tissues. These RNA samples are individual 

25 samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 

30 levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 8 : 

Tissue Normal 

35 Breast 0 
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- 103 


Colon 


10 


Endometrium 


1 


Kidney 


10 


Liver 


1 


Lung 


10 


Ovary 


10 


Prostate 


0 


Small Intestine 


0 


Stomach 


0 


Testis 


10 


Uterus 


1 



10 



Relative levels of expression in Table 8 show that normal 
colon, kidney, lung, and ovary show moderate expression of 
Sqlng046. Low levels of expression are apparent in 
15 endometrium and liver. No expression is apparent in other 
tissues . 

Table 9 shows absolute numbers which are relative 
levels of expression of Sqlng046 in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 

20 (PCR) technology expression levels were analyzed from four 
lOx serial cDNA dilutions in duplicate. Relative 
expression levels of 0, 1, 10, 100 and 1000 are used to 
evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 

2 5 value. 
Table 9 : 



30 



35 



Tissue 


Cancer 


bladder 


1 


breast 


1 


colon 


0 


kidney 


1 


liver 


1 


lung 


0 


ovary 


0 


pancreas 


10 


prostate 


0 


stomach 


1 


testes 


1 


uterus 


1 



40 Relative levels of expression in Table 9 show that Sqlng046 
is expressed in low levels in bladder, breast, kidney, 
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liver, stomach, testis and uterus carcinomas. Sqlng046 is 
expressed in moderate levels only in pancreatic carcinoma. 

Table 10 shows absolute numbers which are relative 
levels of expression of Sqlng046 in 6 lung cancer matching 
5 samples. A matching pair is formed by mRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 
individual. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 
10 serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 10: 

15 Sample ID Tissue Cancer NAT 

9702C115RB lung 10 10 

9502C032 lung 100 100 

8 8 94A lung 10 1 

9704C060RA lung 10 10 

20 11145B lung 1 10 

9502C109R lung 100 1 

Relative levels of expression in Table 10 show that 
Sqlng046 is expressed in higher levels in two of the six 
lung cancer samples compared with the expression in 
25 matching normal adjacent tissue (NAT) . 



Example 4 - SEQ ID NO: 22 

Semi quantitative PCR was done using the following 

primers : 

Sqlng050 forward : 
3 0 5' C C A C T AGG AT TAT T T C C AG C AT AA 3' (SEQ ID NO: 29) 
Sqclng050 reverse : 

5' GGTGTGAAAATATCTGGTCCACTT 3' (SEQ ID NO : 3 0 ) 

Table 12 shows absolute numbers which are relative 
levels of expression of SqlngOBO in 12 normal samples from 




100 
1000 
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12 different tissues. These RNA samples are individual 
samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals. Using Polymerase Chain Reaction (PCR) 
5 technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
10 Table 12: 

Tissue Normal 

Breast 100 

Colon 1000 

Endometrium 10 0 

15 Kidney 100 

Liver 100 
Lung 
Ovary 

Prostate 1000 

20 Small Intestine 100 

Stomach 100 

Testis 10 

Uterus 100 

Relative levels of expression in Table 12 show that normal 
25 colon, ovary, and prostate show high expression of 

SqlngOBO. Moderate levels of expression are apparent in 
breast, endometrium, kidney, liver, lung, small intestine, 
stomach and uterus. Low levels of expression are apparent 
in normal testis 
3 0 Table 13 shows absolute numbers which are relative 

levels of expression of Sqlng050 in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 
(PCR) technology expression levels were analyzed from four 
lOx serial cDNA dilutions in duplicate. Relative 
35 expression levels of 0, 1, 10, 100 and 1000 are used to 

evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 
value . 
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Table 13: 



10 



lis sue 


Cancer 


bladder 




breast 


inn 

X U \J 


colon 


inn 

1 u u 


kidney 


10 


liver 


100 


lung 


100 


ovary 


100 


pancreas 


100 


prostate 


100 


stomach 


100 


testes 


100 


uterus 


1000 



15 Relative levels of expression in Table 13 show that 

Sqlng050 is expressed in low to moderate levels in 11 out 
of 12 different tissue carcinomas. Sqlng050 is only 
expressed in high level in uterus carcinoma. 

Table 14 shows absolute numbers which are relative 

2 0 levels of expression of Sqlng050 in 6 lung cancer matching 

samples. A matching pair is formed by mRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 
individual. Using Polymerase Chain Reaction (PCR) 
25 technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 

3 0 Table 14: 

Sample ID Tissue Cancer NAT 

9702C115RB lung 100 100 

9502C032 lung 1000 1000 

8 8 94A lung 10 0 1 

3 5 97 04C0 6 0RA lung 10 0 10 

11145B lung 100 1000 

95 02C10 9R lung 10 0 10 



Relative levels of expression in Table 14 show that 
Sqlng050 is expressed in higher levels in three of the six 
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lung cancer samples compared to the expression level in the 
matching normal adjacent tissue (NAT) . 

Relative Quantitation of Gene Expression 

Real-Time quantitative PCR with fluorescent Taqman 
5 probes is a quantitation detection system utilizing the 5'- 
3- nuclease activity of Taq DNA polymerase. The method 
uses an internal fluorescent oligonucleotide probe (Taqman) 
labeled with a 5' reporter dye and a downstream, 3' 
quencher dye. During PCR, the 5' -3' nuclease activity of 
10 Taq DNA polymerase releases the reporter, whose 

fluorescence can then be detected by the laser detector of 
the Model 7700 Sequence Detection System (PE Applied 
Biosystems, Foster City, CA, USA) . 

Amplification of an endogenous control is used to 
15 standardize the amount of sample RNA added to the reaction 
and normalize for Reverse Transcriptase (RT) efficiency. 
Either cyclophilin, glyceraldehyde- 3 -phosphate 
dehydrogenase (GAPDH) or 18S ribosomal RNA (rRNA) is used 
as this endogenous control. To calculate relative 
20 quantitation between all the samples studied, the target 
RNA levels for one sample were used as the basis for 
comparative results (calibrator) . Quantitation relative to 
the "calibrator" can be obtained using the standard curve 
method or the comparative method (User Bulletin #2: ABI 
25 PRISM 7700 Sequence Detection System) . 

The tissue distribution and the level of the target 
gene were examined for every example in normal and cancer 
tissue. Total RNA was extracted from normal tissues, 
cancer tissues, and from cancers and the corresponding 
30 matched adjacent tissues. Subsequently, first strand cDNA 
was prepared with reverse transcriptase and the polymerase 
chain reaction was done using primers and Taqman probe 
specific to each target gene. The results are analyzed 
using the ABI PRISM 7700 Sequence Detector. The absolute 
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numbers are relative levels of expression of the target 
gene in a particular tissue compared to the calibrator 
tissue . 

Example 1 - SEQ ID NO: 3 

5 Table 15 shows absolute numbers which are relative 

levels of expression of the LSG of SEQ ID NO: 3 in 24 normal 
different tissues. All the values are compared to normal 
small intestine (calibrator) . These RNA samples are 
commercially available pools, originated by pooling samples 
10 of a particular tissue from different individuals. 
Table 15: 

Tissue NORMAL 

Adrenal Gland 0.56 

Bladder 0.03 
15 Brain 2 . 57 

Cervix 0.42 

Colon 0.3 3 

Endometrium 5.12 

Esophagus 0.06 
20 Heart 0 . 08 

Kidney 1 • 2 

Liver 1.38 

Lung 5.54 

Mammary Gland 3.96 
25 Muscle 0.44 

Ovary 1.2 9 

Pancreas 7 . 94 

Prostate 5.21 

Rectum 1.3 6 

3 0 Small Intestine 1 

Spleen 36.89 
Stomach 2 . 8 

Testis 10 . 16 

Thymus 179.15 
35 Trachea 3 . 08 

Uterus 1.04 
0=negative 

The relative levels of expression in Table 15 show 
that mRNA expression of the LSG of SEQ ID NO : 3 is very high 

4 0 in thymus (17 9.15) compared with all the other normal 

tissues analyzed. The expression level of the LSG of SEQ 
ID NO: 3 is moderate in normal lung. Small intestine, the 
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calibrator, has a relative expression level of 1. These 
results demonstrated that mRNA expression of the LSG of SEQ 
ID N0:3 is relatively specific for lung. 

The absolute numbers in Table 15 were obtained 
5 analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 16. 

Table 16 shows absolute numbers which are relative 

10 levels of expression of the LSG of SEQ ID NO : 3 in 79 pairs 
of matching samples and 2 normal blood samples. All the 
values are compared to normal small intestine (calibrator) . 
A matching pair is formed by mRNA from the cancer sample 
for a particular tissue and mRNA from the normal adjacent 

15 sample for that same tissue from the same individual. 
Table 16: 

Ti s sue NORMAL 



40 



Sample 
ID 



Cancer 
Type 



Adenocarcinoma 
Adenocarcinoma 
Adenocarcinoma 
Adenocarc inoma 
Adenocarcinoma 
Adenocarc inoma 
Adenocarcinoma 
Adenocarcinoma 
Adenocarcinoma 
Adenocarcinoma 
Adenocarc inoma 
Adenocarcinoma 
Adenocarcinoma 
Bronchio- alveolar 
carcinoma 
3 5 LngBA6 4 1 Bronchio-alveolar 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
S quamou s cell 
5 0 carcinoma 



2 0 Lng6 0L 
Lngl4 3L 
Lng6 0XL 
LngAC82 
LngAC8 8 

2 5 LngAC6 6 

LngAC6 9 
LngACll 
LngAC3 2 
LngAC94 

3 0 LngAC9 0 

Lng22 3L 
LngAC3 9 
LngBR2 6 



LngSQ4 5 
LngSQ14 
LngSQ9X 
LngSQ56 
4 5 LngSQ8 0 
LngSQ32 
LngSQ16 



Lung 1 
Lung 2 
Lung 3 
Lung 4 
Lung 5 
Lung 6 
Lung 7 
Lung 8 
Lung 9 
Lung 1 0 
Lung 1 1 
Lung 1 2 
Lung 1 3 
Lung 14 

Lung 15 

Lung 1 6 

Lung 17 

Lung 1 8 

Lung 19 

Lung 2 0 

Lung 21 

Lung 2 2 



CANCER 

1 .32 
9.29 
41 . 5 
60 . 97 
50 .21 

1 .42 
2.3 

2 .41 

3 . 9 
2 .65 
16 . 85 
1 .48 
139 . 1 
41 .79 

37 . 14 

4 . 92 
7 . 06 

38 . 32 
55.72 
34 .42 
69 . 55 
1.7 



MATCHING 
NORMAL 
ADJACENT 

0 . 95 
0 . 96 
13 . 18 
2 . 04 
31 .89 
0 . 72 
0 . 73 
1.95 
0 .69 
0 . 77 
0 . 57 

0 . 06 

1 . 52 
8 . 57 

16 

4.01 
15 . 19 
1 . 78 
33 . 01 
4 . 3 
21 . 86 
0 . 22 
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LnaS07 9 


S quamou s cell 


Lung 


23 








carcinoma 










Lng4 7XQ 


Squamous cell 


Lung 


24 








carcinoma 








> 


T,naRR94 


Squamous cell 


Lung 


25 








carcinoma 










T,naP2 OX 


Squamous cell 


Lung 


26 








carcinoma 










T.na c ?044 


S quamou s cell 


Lung 


27 




10 




carcinoma 










T.ncr90X 


Squamous eel 1 


Lung 


28 








carcinoma 










T.naS04 3 


S qu amou s cell 


Lung 


29 








carcinoma 








j — ' 


TinaLC71 


Large cell 


Lung 


30 








carcinoma 










LngLC10 9 


Large cell 


Lung 


31 








carcinoma 










LngLC8 0 


Large cell 


Lung 


32 




20 




carcinoma 










T.na77L 


Large cell 


Lung 


33 








carcinoma 








Lng315L 


Lung carcinoma 


Lung 


34 






Lng528L 


Lung c a r c i noma 


Lung 


35 




25 


Lng7 5XC 


Metastatic from 


Lung 


36 








Osteogenic Sarcoma 




37 






LngMT67 


Metastatic from 


Lung 








renal cell cancer 










LngMT71 


Metastatic from 


Lung 


38 




30 




melanoma 
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35 



40 



45 



50 



55 



60 



Bld46XK 

BldTR14 

B5 

B6 

CvxKS52 

CvxKS 8 3 

ClnAS43 

ClnAS45 

ClnAS46 

Cln AS67 

Cln AS89 

Endo2 8XA 

Endol0479 

Endo6 8X 

KidlOXD 

Kidl09XD 

LivlSXA 

Livl74L 

Mam3 5 5 

Maml7 3M 

Mam2 2 0 

Mam97 6M 

Ovrl80B 

OvrA084 

Pan77X 

Pan92X 

ProlOlXB 

Prol09XB 

Prol25XB 

Prol3XB 

Skn3 9A 

Skn816S 

SmInt21XA 



Bladder 1 
Bladder 2 
Blood 1 
Blood 2 
Cervix 1 
Cervix 2 
Colon 1 
Colon 2 
Colon 3 
Colon 4 
Colon 5 
Endometrium 
Endometrium 
Endometrium 
Kidney 1 
Kidney 2 
Liver 1 
Liver 2 
Mammary 1 
Mammary 2 
Mammary 3 
Mammary 4 
Ovary 1 
Ovary 2 
Pancreas 1 
Pancreas 2 
Prostate 1 
Prostate 2 
Prostate 3 
Prostate 4 
Skin 1 
Skin 2 
Sm. Int. 1 



154 . 34 
177 . 91 



4 . 71 


3 . 


04 


35.26 


1 . 


42 


138 .62 


0 . 


19 


3 . 05 


0 . 


18 


7 . 06 


3 . 


97 


1 .49 


0 . 


66 


97 . 01 


1 . 


71 


27 . 86 


16 .22 


102 . 89 


20.25 


34 . 66 


10 . 13 


1 . 03 


9 


.22 


36.25 
21.48 
3 .53 


50.39 
6.54 
4 . 55 


8.2 


3 


. 97 


13 . 93 


19 .23 


0 

1 . 57 


0 
0 


. 78 



11 . 96 
92 . 09 
4 . 03 
0 .28 
0.38 
0 . 62 
0 . 09 
15 .51 
24 

13 . 13 
3 . 07 
8 . 22 
0 . 17 
0 . 15 
2 .63 
6 . 87 
0 .29 

0 . 19 
25 .72 
2.7 

8 . 11 
27 .28 
6 . 99 

1 . 42 

2 . 24 
0 .41 

3 . 71 
25 . 81 
4.35 



2 .27 
8 .66 



29 
17 



0 . 
0 . 

0 . 
7 , 



0 . 59 
1.78 
0 . 05 
4 .77 
7 . 14 
14 .42 
2 . 07 
7 . 24 
.09 
. 32 
. 15 
.67 
0 . 87 

0 . 91 

0 

1 . 97 
3.25 
21 . 78 
4 .68 
1 . 16 
1 . 71 
1 . 59 
0.35 

0 . 34 

1 . 17 
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13 


. 93 


3 


16 


4 . 


59 


5 


17 


6 . 


39 


4 


16 


5 . 


01 


0 


46 


6 . 


39 


4 


58 


0 . 


86 


1 


55 


2 . 


49 


0 


.43 


9 . 


16 


3 


.89 


0 . 


34 


0 


.43 


2 . 


51 


0 


.63 
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StollSS Stomach 1 

Sto264S Stomach 2 

Sto288S Stomach 3 

5 Thr270T Thyroid 1 

Thr93 9T Thyroid 2 

Tst647T Testis 1 

Tst663T Testis 2 

Utrl35XO Uterus 1 

10 Utrl41XO Uterus 2 

In the analysis of matching samples, the higher levels of 
expression were in lung showing a high degree of tissue 
specificity for lung tissue. These results confirm the 
tissue specificity results obtained with normal pooled 
15 samples (Table 15) . 

Furthermore, the levels of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the 
same individual were compared. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 

2 0 levels of mRNA expression in the cancer sample compared to 

the normal adjacent) . Table 16 shows overexpression of the 
LSG of SEQ ID NO: 3 in 2 6 lung cancer tissues compared with 
their respective normal adjacent (lung samples #2, 3, 4, 5, 
6, 7, 9, 10, 11, 12, 14, 15, 18, 19, 20, 21, 22, 23, 24, 
25 25, 26, 27, 29, 30, 31, and 32). There is overexpression 
in the cancer tissue for 68% of the lung matching samples 
tested (total of 38 lung matching samples) . 

Altogether, the relative high level of lung tissue 
specificity, plus the mRNA overexpression in 68% of the 

3 0 lung carcinoma matching samples tested are believed to make 

the LSG of SEQ ID NO : 3 a good diagnostic marker for lung 
cancer . 

Primers used for expression analysis are: 
Forward 

3 5 5' AGCCATTGCCATCCCAGT 3' (SEQ ID NO: 31) 
Reverse 

5' ATGTTCTTCACGCTCTTCGC 3' (SEQ ID NO: 32) 
Probe 
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5' AGGAAGTGCTGGAAGAGGCTGGCT 3' (SEQ ID NO: 33) 

Example 2 - SEQ ID NO: 15 

Table 17 shows absolute numbers which are relative 
levels of expression of the LSG of SEQ ID NO: 15 in 24 
5 normal different tissues. All the values are compared to 
normal brain (calibrator) . These RNA samples are 
commercially available pools, originated by pooling samples 
of a particular tissue from different individuals. 
Table 17: 





10 


Tissue 


NORMAL 






Adrenal Gland 


67 . 65 






Bladder 


39.67 


'SWT 




Brain 


1 . 00 






Cervix 


677 . 93 




15 


Colon 


1287 . 18 






Endometrium 


162 . 58 






Esophagus 


1034 . 70 






Heart 


4 . 81 






Kidney 


25 . 02 




20 


Liver 


194 . 01 






Lung 


4705 . 07 






Mammary Gland 


840 . 44 






Muscle 


12 . 91 






Ovary 


608 . 87 




25 


Pancreas 


20.89 






Prostate 


858.10 






Rectum 


4435 . 87 






Small Intestine 


2149 . 82 






Spleen 


5595 .30 




30 


Stomach 


14115 . 57 






Testis 


64 .67 






Thymus 


2187 .40 






Trachea 


2866 .35 






Uterus 


193 . 34 




35 


0=negative 





The relative levels of expression in Table 17 show that 
mRNA expression of the LSG of SEQ ID NO: 15 is very high in 
stomach (14115.57) compared with all the other normal 
tissues analyzed. Expression levels of this LSG are 
4 0 moderate in normal lung (47 05.07) Brain, the calibrator, 
has a relative expression level of 1. These results 
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demonstrate that mRNA expression of the LSG of SEQ ID NO: 15 
is relatively specific for lung. 

The absolute numbers in Table 17 were obtained 
analyzing pools of samples of a particular tissue from 
5 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 18. 
Table 18: 
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carcinoma 






Lng90X 


Squamous cell 
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carcinoma 






LngSQ4 3 


Squamous cell 








carcinoma 






LngLC71 


Large cell 








carcinoma 




40 


LngLC10 9 


Large cell 








carcinoma 






LngMT71 


Metastatic from 








melanoma 






Bld46XK 






45 


BldTR14 








CvxKS 5 2 








ClnAS43 








ClnAS4 5 








ClnAS46 
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ClnAS67 








ClnAS89 








Endo2 8XA 








KidlOXD 








LivlSXA 






55 


Mam3 5 5 








Pan77X 





T i s sue NORMAL 


CANCER 


MATCHING 

NORMAL 

ADJACENT 


Lung 1 
Lung 2 
Lung 3 
Lung 4 
Lung 5 
Lung 6 
Lung 7 

T unn Q 

i_iUiiy o 
Lung 9 


18561 .17 
28 . 54 
16555 .24 
18116 .29 
4389 . 98 
18179 . 19 
10623 .71 
8393 . 17 
13 . 98 


5732 . 70 
1 . 57 
3408 . 69 
1891 . 09 
5732 . 70 
10015 . 87 
309 . 76 
491 . 14 
20 .68 


Lung 1 0 


34 . 78 


10.13 


Lung 11 


9184 . 59 


8995 . 58 


Lung 1 2 


2 . 82 


32 . 11 


Lung 1 3 


68 . 12 


4 . 07 


Lung 14 


3373 . 43 


86 . 22 


Lung 1 5 


19215 .37 


81245 . 48 


Lung 1 6 


5 . 19 


1 . 14 


Lung 17 


24 . 17 


2 . 12 


Lung 1 8 


67 .42 


25.37 


Lung 19 


12 .38 


3 . 96 


Lung 2 0 


13 . 00 


9 .45 


Bladder 1 
Bladder 2 
Cervix 1 
Colon 1 
Colon 2 
Colon 3 
Colon 4 
Colon 5 
Endometrium 1 
Kidney 1 
Liver 1 
Mammary 1 
Pancreas 1 


131 . 60 
8306 .36 
24 .85 
1590 .21 
1458 .23 
2418 . 67 
365 . 82 
2304 .12 
10 . 70 
0 .38 
19.16 
16 . 56 
0 .15 


5 . 90 
7009 . 03 
8 . 91 
8335 . 19 
1820 .35 
3019 . 30 
823 .14 
75.32 
0.49 
0 .21 
115 . 76 
0 . 18 
0 . 07 



• 



# 
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ProlOlXB 

Skn816S 

SmInt21XA 

Sto288S 

Thr270T 

Tst647T 

Utrl35XO 

0=negat ive 



- 114 - 

Prostate 1 
Skin 1 
Sm. Int. 1 
Stomach 1 
Thyroid 1 
Testis 1 
Uterus 1 



.46 
.28 
.43 
.41 
. 99 
1217 . 75 
237.21 
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1.05 
0 . 10 
12 . 04 

14 . 32 
0 . 14 

15 . 62 
55 . 14 
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In the analysis of matching samples, the level of mRNA 
expression in cancer samples and the isogenic normal 
adjacent tissue from the same individual were compared. 
This comparison provides an indication of specificity for 
the cancer stage (e.g. higher levels of mRNA expression in 
the cancer sample compared to the normal adjacent) . Table 
15 18 shows overexpression of the LSG of SEQ ID NO: 15 in 14 
lung cancer tissues compared with their respective normal 
adjacent (lung samples #1, 2, 3, 4, 6, 7, 8, 10, 13, 14, 
16,17,18, and 19). There is overexpression in the cancer 
tissue for 70% of the lung matching samples tested (total 
20 of 20 lung matching samples) . 

Altogether, the relative high level of lung tissue 
specificity, plus the mRNA overexpression in 70% of the 
lung carcinoma matching samples tested are believed to make 
the LSG of SEQ DI NO: 15 a good diagnostic marker for lung 
25 cancer. 

Primers used for expression analysis in this example 
are as follows: 
Forward 

5' AAGGGAGCACCGTGGAGAA 3' (SEQ ID NO: 34) 
3 0 Reverse 

5' AGGGCTGGATGACTTGGGA 3' (SEQ ID NO: 35) 
Probe 

5' TTCCCAACTCTAACCCCACCCACG 3' (SEQ ID NO : 3 6 ) 



